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The T2K collaboration reports evidence for electron neutrino appearance at the atmospheric mass 
splitting, |Am3 2 | « 2.4 x 10~ 3 eV 2 . An excess of electron neutrino interactions over background 
is observed from a muon neutrino beam with a peak energy of 0.6 GeV at the Super-Kamiokande 
(SK) detector 295 km from the beam's origin. Signal and background predictions are constrained 
by data from near detectors located 280 m from the neutrino production target. We observe 11 
electron neutrino candidate events at the SK detector when a background of 3.3 ± 0.4(syst.) events 
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is expected. The background-only hypothesis is rejected with a p- value of 0.0009 (3.1a), and a 
fit assuming — > v e oscillations with sin 2 2(?23=l, 5cp~0 and |Am3 2 | = 2.4 x 10~ 3 eV 2 yields 
sin 2 20i3=O.O88l^^(stat.+syst.). 

PACS numbers: 14.60.Pq,14.60.Lm,12.27.-a,29.40.ka 



INTRODUCTION 



the term (in units of c, H = 1): 



The phenomena of neutrino oscillations through the 
mixing of massive neutrinos have been well established 
by experiments observing neutrino interaction rates from 
solar [IH2], atmospheric [8HT3]. reactor [14] and accel- 
erator [I5HT8] sources. With few exceptions, such as 
the results from the LSND 1!) and MiniBooNE collab- 
orations [20]: the observations are consistent with the 
mixing of three neutrinos, governed by three mixing an- 
gles: #i2 ~ 34°, 6 2 3 ~ 45° and #i 3 ; and an as-yet- 
undetermined CP-violating phase, 5c p- Neutrino mix- 
ing also depends on three mass states, m.;, and there- 
fore two independent mass splittings, |Am§ 2 | ~ 2.4 x 
10~ 3 eV 2 (atmospheric) and Am^ « 7.6 x 10~ 5 eV 2 (so- 
lar), where Am,^ = m 2 — m 2 . Additional understand- 
ing of neutrino mixing can be gained by observing the 
appearance of one flavor of neutrino interactions in a 
beam of another flavor through charged current interac- 
tions. Recently, T2K [2T] has reported on the appearance 
of electron neutrinos in a beam of muon neutrinos, and 
the OPERA [22] and Super-Kamiokande [23] collabora- 
tions have reported on the appearance of tau neutrinos 
from accelerator-based and atmospheric muon neutrino 
sources, respectively. 

The oscillations of — > v e that T2K searches for are 
of particular interest since the observation of this mode 
at a baseline over energy ratio (L/E) of ~ 1 GeV/500 km 
implies a non-zero value for the mixing angle #13. Un- 
til recently, the mixing angle #13 had only been con- 
strained to be less than 11° by reactor 2 JJ and accel- 
erator [25 neutrino experiments. With data collected 
through 2011, the T2K experiment found the first indi- 
cation of non-zero #13 in the oscillation of muon neutrinos 
to electron neutrinos |21j . Since then, a non-zero value of 
#13 = 9.1° ±0.6° [21] has been confirmed from the disap- 
pearance of reactor electron anti-neutrinos observed by 
the Daya Bay [27], RENO [25] and Double Chooz [2J] ex- 
periments. In this paper, T2K updates its measurement 
of electron neutrino appearance using additional data col- 
lected through 2012 and improved analysis methods. 

The probability for electron neutrino appearance in a 
muon neutrino beam with energy E v of 0(1) GeV prop- 
agating over a baseline L of 0(100) km is dominated by 
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P„ M _>„. « sin 2 # 23 sin 2 2# 13 sin 2 ^^. (1) 

This leading term is identical for neutrino and antineu- 
trino oscillations. Since the probability depends on 
sin 2 #23, a precise determination of #13 requires measure- 
ments of #23- The dependence on sin 2 #23 can help resolve 
the eightfold ambiguity that arises when determining #23 
from muon neutrino disappearance, which depends on 
sin 2 2#23- The electron neutrino appearance probability 
also includes sub-leading terms which depend on Sep and 
terms that describe matter interactions 130 : 
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2\f2GpN e . Ev . 2 , where N e is the electron density of the 
Earth's crust. In the three- neutrino paradigm CP vio- 
lation can only occur when all three mixing angles, in- 
cluding #13, have non-zero values. The second term has a 
negative sign for neutrinos and a positive sign for antineu- 
trinos and violates CP, which suggests the possibility of 
observing CP violation by measuring the difference in the 
electron neutrino appearance probabilities for neutrinos 
and antineutrinos. Since the CP-violating term can only 
appear in an appearance probability, a measurement of v e 
appearance, such as the one described in this paper, is an 
important milestone towards future searches for CP vio- 
lation. The A dependence in the oscillation probability 
arises from matter effects and introduces a dependence 
on the sign of Am§ 2 . We refer to Am§ 2 > as the nor- 
mal mass hierarchy and Am§ 2 < as the inverted mass 
hierarchy. 

This paper is organized as follows. Section [IT] is a brief 
overview of the T2K experiment and the data-taking pe- 
riods. Section |III| summarizes the analysis method and 
components, including the flux (Section |lV[ ), neutrino in- 
teraction model (Section|v]) and near detector and far de- 
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tector data samples (Section VI and Section VIII respec- 
tively). The lit to near detector data, described in Sec- 
tion |VII| is used to constrain the far detector rate and as- 
sociated uncertainties. Finally, Section [TX| describes how 
the far detector v & sample is used to estimate sin 2 26>i 3 . 



II. EXPERIMENTAL OVERVIEW AND DATA 
COLLECTION 

The T2K experiment [3T] is optimized to observe elec- 
tron neutrino appearance in a muon neutrino beam. We 
sample a beam of muon neutrinos generated at the J- 
PARC accelerator facility in Tokai-mura, Japan, at base- 
lines of 280 m and 295 km from the neutrino production 
target. The T2K neutrino beam line accepts a 31 GeV/c 
proton beam from the J-PARC accelerator complex. The 
proton beam is delivered in 5 /its long spills with a period 
that has been decreased from 3.64 s to 2.56 s over the 
data-taking periods described in this paper. Each spill 
consists of 8 equally spaced bunches (a significant subset 
of the data was collected with 6 bunches per spill) that 
are ~ 15 ns wide. The protons strike a 91.4 cm long 
graphite target, producing hadrons including pions and 
kaons, and positively charged particles are focused by a 
series of three magnetic horns operating at 250 kA. The 
pions, kaons and some muons decay in a 96 m long vol- 
ume to produce a predominantly muon neutrino beam. 
The remaining protons and particles which have not de- 
cayed are stopped in a beam dump. A muon monitor 
situated downstream of the beam dump measures the 
profile of muons from hadron decay and monitors the 
beam direction and intensity. 

We detect neutrinos at both near (280 m from the tar- 
get) and far (295 km from the target) detectors. The far 
detector is the Super-Kamiokande (SK) water Cherenkov 
detector. The beam is aimed 2.5° (44 mrad) away from 
the target-to-SK axis to optimize the neutrino energy 
spectrum for the oscillation measurements. The off-axis 
configuration 32 34| takes advantage of the kinematics of 
pion decays to produce a narrow band beam. The angle 
is chosen so that the spectrum peaks at the first oscilla- 
tion maximum, as shown in Fig.[l] maximizing the signal 
in the oscillation region and minimizing feed-down back- 
grounds from high energy neutrino interactions. This 
optimization is possible because the value of |Ato§ 2 | is 
already relatively well known. 

The near detectors measure the properties of the beam 
at a baseline where oscillation effects are negligible. The 
on-axis INGRID detector [35] ES] consists of 16 mod- 
ules of interleaved scintillator/iron layers in a cross con- 
figuration centered on the nominal neutrino beam axis, 
covering ±5 m transverse to the beam direction along 
the horizontal and vertical axes. The INGRID detector 
monitors the neutrino event rate stability at each mod- 
ule, and the neutrino beam direction using the profile of 
event rates across the modules. 

The off-axis ND280 detector is a magnetized multi- 
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FIG. 1: The muon neutrino survival probability (top) 
and electron neutrino appearance probabilities (middle) 
at 295 km, and the unoscillated neutrino fluxes for 
different values of the off-axis angle (OA) (bottom). 
The appearance probability is shown for two values of 
the phase 8c p, and for normal (NH) and inverted (IH) 
mass hierarchies. 



purpose detector that is situated along the same di- 
rection as SK. It measures the neutrino beam compo- 
sition and energy spectrum prior to oscillations and is 
used to study neutrino interactions. The ND280 detec- 
tor utilizes a 0.2 T magnetic field generated by the re- 
furbished UAl/NOMAD magnet and consists of a num- 
ber of sub-detectors: side muon range detectors (SM- 
RDs [37]), electromagnetic calorimeters (ECALs), a tt° 
detector (POD 38J) and a tracking detector. The tracking 
detector is composed of two fine-grained scintillator bar 
detectors (FGDs [35]) sandwiched between three gaseous 
time projection chambers (TPCs gD]). The first FGD 
primarily consists of polystyrene scintillator and acts as 
the target for most of the near detector neutrino inter- 
actions that are treated in this paper. Hence, neutrino 
interactions in the first FGD are predominantly on car- 
bon nuclei. The ND280 detector is illustrated in Fig. [2] 
where the coordinate convention is also indicated. The 
x and z axes are in the horizontal plane, and the y axis 
is vertical. The origin is at the center of the magnet, 
and the magnetic field is along the x direction. The z 
axis is the direction to the far detector projected to the 
horizontal plane. 
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FIG. 3: An illustration of the SK detector. 



FIG. 2: An exploded illustration of the ND280 detector. 
The description of the component detectors can be 
found in the text. 



The SK far detector [41] , as illustrated in Fig. [3j is a 
50 kt water Cherenkov detector located in the Kamioka 
Observatory. The cylindrically-shaped water tank is op- 
tically separated to make two concentric detectors : an 
inner detector (ID) viewed by 11129 inward-looking 20 
inch photomultipliers, and an outer detector (OD) with 
1885 outward-facing 8 inch photomultipliers. The fidu- 
cial volume is defined to be a cylinder whose surface is 
2 m away from the ID wall, providing a fiducial mass 
of 22.5 kt. Cherenkov photons from charged particles 
produced in neutrino interactions form ring-shaped pat- 
terns on the detector walls, and are detected by the pho- 
tomultipliers. The ring topology can be used to iden- 
tify the type of particle and, for charged current inter- 
actions, the flavor of the neutrino that interacted. For 
example, electrons from electron neutrino interactions 
undergo large multiple scattering and induce electromag- 
netic showers, resulting in fuzzy ring patterns. In con- 
trast, the heavier muons from muon neutrino interactions 
produce Cherenkov rings with sharp edges. 

The T2K experiment uses a special software trigger to 
associate neutrino interactions in SK to neutrinos pro- 
duced in the T2K beam. The T2K trigger records all 
the photomultiplicr hits within ±500 /is of the beam ar- 
rival time at SK. Beam timing information is measured 
spill-by-spill at J-PARC and immediately passed to the 
online computing system at SK. The time synchroniza- 
tion between the two sites is done using the Global Po- 
sitioning System (GPS) with < 150 ns precision and is 
monitored with the Common- View method 02] . Spill 
events recorded by the T2K triggers are processed offline 
to apply the usual SK software triggers used to search 
for neutrino events, and any candidate events found are 
extracted for further T2K data analysis. Spills used for 



TABLE I: T2K data-taking periods and the integrated 
protons on target (POT) for SK data collected in those 
periods. 



Run Period 




Dates 




Integrated POT by SK 


Run 1 
Run 2 
Run 3 


Jan. 
Nov. 
Mar. 


2010- Jun. 
2010-Mar. 
2012- Jun. 


2010 
2011 
2012 


0.32 x 10 2U 
1.11 x 10 20 
1.58 x 10 20 



the far detector data analysis are selected by beam and 
SK quality cuts. The primary reason spills are rejected 
at SK is due to the requirement that there are no events 
in the 100 fis before the beam window, which is necessary 
to reject decay electrons from cosmic-ray muons. 

In this paper we present neutrino data collected during 
the three run periods listed in Table [T] The total SK data 
set corresponds to 3.01 x 10 20 protons on target (POT) 
or 4% of the T2K design exposure. About 50% of the 
data, the Run 3 data, were collected after T2K and J- 
PARC recovered from the 2011 Tohoku earthquake. A 
subset of data corresponding to 0.21 x 10 20 POT from 
Run 3 was collected with the magnetic horns operating 
at 205 kA instead of the nominal value of 250 kA. The size 
of the total data set is approximately two times that of 
T2K's previously published electron neutrino appearance 
result [2Pj . 

We monitor the rate and direction of the neutrino 
beam over the full data-taking period with the INGRID 
detector. As illustrated in Fig. [4] the POT-normalized 
neutrino event rate is stable to within 1%, and the beam 
direction is controlled well within the design requirement 
of 1 mrad, which corresponds to a 2% shift in the peak 
energy of the neutrino spectrum. 
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FIG. 4: The time dependence of the POT-normalized reconstructed neutrino event rate (a) and the beam direction 
(b) measured by INGRID. The error bars show the statistical uncertainty only. The points shown for the direction 
measurement include sequential data grouped in periods of stable beam conditions. 



III. ANALYSIS OVERVIEW 

We search for — > v e oscillations via charged current 
quasi-elastic (CCQE) interactions of v e at SK. Since the 
recoil proton from the target nucleus is typically below 
Cherenkov threshold, these events are characterized by a 
single electron-like ring and no other activity. The most 
significant background sources are intrinsic v e from muon 
and kaon decays, and neutral current ir° (NC7r°) events 
where the detector response to the photons from the tt° 
decay is consistent with a single electron-like ring. The 
selection of v e candidates is described in Section |VIH[ 

We estimate the oscillation parameters and produce 
confidence intervals using a model that describes the 
probabilities to observe v e candidate events at SK in 
bins of electron momentum (magnitude and direction), 
as described in Section [DC] The probabilities depend on 
the values of the oscillation parameters as well as many 
nuisance parameters that arise from uncertainties in neu- 
trino fluxes, neutrino interactions, and detector response. 
The point where the likelihood is maximum for the ob- 
served data sample gives the oscillation parameter esti- 
mates, and the likelihood ratio at other points is used to 
construct confidence intervals on the parameters. 

We model the neutrino flux with a data-driven simu- 
lation that takes as inputs measurements of the proton 
beam, hadron interactions and the horn fields [3H]- The 
uncertainties on the flux model parameters arise largely 
from the uncertainties on these measurements. The flux 
model and its uncertainties are described in Section HVl 

We model the interactions of neutrinos in the detec- 



tors assuming interactions on a quasi-free nucleon using a 
dipole parametrization for vector and axial form factors. 
The nuclei are treated as a relativistic Fermi gas, and 
outgoing hadrons are subject to interactions in the nu- 
cleus, so-called "final state interactions" . We validate the 
neutrino interaction model with comparisons to indepen- 
dent neutrino cross section measurements at 0(1) GeV 
and pion scattering data. We set the uncertainties on the 
interaction model with comparisons of the model to data 
and alternate models. The neutrino interaction model 
and its uncertainties are described in Section fVl 

We further constrain the flux and interaction model 
parameters with a fit to samples of neutrino interaction 
candidates in the ND280 detector. Selections containing 
a negative muon-like particle provide high purity sam- 
ples of interactions, which constrain both the flux 
that determines signal and NC7T backgrounds at SK, 
and the intrinsic v e flux. In the energy range of interest, 
the intrinsic v e are predominantly produced from the de- 
cay chain ir + — > /i + + v^, /i + — > e + + v e + D^, and to a 
lesser extent by three-body kaon decays. Hence, the v e 
flux is correlated with the flux through the production 
of pions and kaons in the T2K beam line. The charged 
current interactions that make up most of the ND280 
samples constrain the charged current interaction model. 
While v e interactions are indirectly constrained by z/„ 
interactions, we also include uncertainties which account 
for differences between the and v e cross section model. 
The ND280 neutrino interaction sample selection is de- 
scribed in Section |VI| and the fit of the neutrino flux 
and interaction models to this data is described in Sec- 
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IV. NEUTRINO FLUX MODEL 

We simulate the T2K beam line to calculate the neu- 
trino flux at the near and far detectors in the absence of 
neutrino oscillations, and this flux model is used as an 
input to predict neutrino interaction event rates at the 
detectors. 

The flux simulation begins with the primary proton 
beam upstream of the collimator that sits in front of 
the T2K target. The interactions of particles in the 
target, beam line components, decay volume walls and 
beam dump, and their decays, are simulated. The sim- 
ulation and its associated uncertainties are driven by 
measurements of the primary proton beam profile, mea- 
surements of the magnetic fields of the T2K horns, and 
hadron production data, including NA61/SHINE mea- 
surements j3HI3S]. First, we model the interactions of the 
primary beam protons and subsequently produced parti- 
cles in the graphite target with a FLUKA 2008 gl g7] 
simulation. We pass any particles that exit the tar- 
get into a GEANT3 [35] simulation that tracks parti- 
cles through the magnetic horns and decay region, and 
decays hadrons and muons to neutrinos. The hadron in- 
teractions in the GEANT3 simulation are modeled with 
GCALOR [49 . To improve agreement between selected 
hadron interaction measurements and the simulation, we 
weight simulated events based on the stored information 
of the hadron interactions in events producing neutrinos. 

The predicted flux at the SK and ND280 detectors, 
including systematic errors, is shown in Fig. [5j Here we 
describe the methods for weighting the flux and evaluat- 
ing uncertainties based on proton beam measurements, 
hadron interaction data, alignment measurements, horn 
current and field measurements, and the beam direction 
measurement from the 1NGRID detector. More details 
of the flux calculation are described in Ref. |4"3"1. 



Weighting and systematic error evaluation 
methods 
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FIG. 5: The T2K flux prediction at SK (a) and ND280 

(a) for neutrinos and antineutrinos with systematic 
error bars. The flux above E v — 10 GeV is not shown; 
the flux is simulated up to E v — 30 GeV. 



are calculated. A large number of parameters sets, N 
(typically 500 or more), are used to calculate the frac- 
tional covariance using: 



To tune the flux model and study its uncertainties, ad- 
justments are made by weighting events based on kine- 
matics of the hadron interactions or the primary proton. 
The sensitivities to nuisance parameters that arise from 
such uncertainties as the hadron production model, pro- 
ton beam profile, or horn currents, are evaluated by their 
effect on the predicted neutrino spectrum. 

We use one of two approaches for each uncertainty 
source, depending on whether the uncertainty source has 
correlations that need to be treated. For error sources de- 
scribed by a number of correlated underlying parameters, 
we use weighting methods when possible. The nuisance 
parameters are sampled according to their covariance and 
the corresponding flux predictions for the k samples, (j> k , 



= 1 (^? om -^)(^° m -^) f , 

l 3 AT ' i Anora J.nom V / 

fe=l Vi V J 

Here cf>2° m is the nominal flux prediction and i specifies a 
neutrino energy bin, flavor and detector at which the flux 
is evaluated. We evaluate hadron interaction and proton 
beam profile uncertainties with this method. 

For systematic variations that cannot be treated by 
weighting simulated events, such as misalignment of 
beam line elements or changes to the horn currents, we 
produce new simulated event samples with ±lcr varia- 
tions of the nuisance parameters and calculate the frac- 



tional covariance matrix: 

i (<£r m 



nom 



j^nom ^norn 
nom 



j^nom ^pnom 



(4) 



and </>~ are the flux prediction for +ler and — ltr vari- 
ations of the nuisance parameter. We evaluate horn and 
target alignment and horn current and field uncertainties 
with this method. 

The total fractional flux covariance matrix is the sum 
of fractional flux covariance matrices calculated for each 
source of uncertainty. For the fits to data described in 
Sections | VII| and |IX[ variations of the flux prediction are 
modeled with parameters bi that scale the normalization 
of the flux in bins of neutrino energy and flavor at a 
given detector. The covariance matrix of the bi, (Vb)ij, 
is simply the total fractional flux covariance matrix de- 
scribed here. Since the bi are separated for the near and 
far detectors, their covariances account for the correla- 
tions between the flux predictions at the two detectors. 
The covariances can therefore be used directly in simul- 
taneous fits of near and far detector data or to calculate 
the uncertainty on the ratio of flux spectra at the two 
detectors. 

The following sections describe each source of flux sys- 
tematic uncertainty. 



TABLE II: Summary of measured proton beam profile 
parameters and uncertainties at the collimator for a 
typical run period : mean position (X,Y) and angle 
(X',Y'), width (a), emittance (e), and Twiss 
parameter (a). 



X Profile 

Parameter Central Value Error 


Y Profile 
Central Value Error 


X, Y (mm) 0.00 0.35 
X',Y' (mrad) 0.03 0.07 
a (mm) 4.03 0.14 
e (n mm mrad) 4.94 0.54 
a 0.33 0.08 


-0.37 0.38 
0.07 0.28 
4.22 0.12 
6.02 3.42 
0.34 0.41 



TABLE III: Differential hadron production data 
relevant for the T2K neutrino flux predictions. 



Experiment 



Beam Mom. Target Particles 



NA61/SHINE g3 
Eichten et al. [5T] 
Allaby et al. [52] 
BNL-E910 [53] 



31 GeV/c C 
24 GeV/c Be, 
19.2 GeV/c Be, 
6.4-17.5 GeV/c Be 



Al. 
Al. 



P, 7T 
P, 7T d 



The variations correspond to shifts in the off-axis angle 
of ~ 0.35 mrad, or shifts in the off-axis spectrum peak of 
~ 10 MeV. 



B. Proton beam monitoring and simulation 

We simulate the proton beam according to the pro- 
ton orbit and optics parameters measured by the proton 
beam position and profile monitors, and the number of 
protons measured by the intensity monitors. These mon- 
itors are described elsewhere [3TJ[50]. We measure proton 
beam properties for each run period by reconstructing the 
beam profile at the upstream end of the collimator that 
sits before the T2K target for each beam spill. The sum 
of profiles for each beam spill, weighted by the number of 
protons, gives the proton beam profile that we input to 
the flux simulation. Table [IT] summarizes the measured 
mean position, angle, emittance, Twiss a parameter and 
width of the proton beam at the collimator, and their 
uncertainties for a typical run period. The largest con- 
tributions to the flux uncertainty from the proton beam 
simulation arise from the alignment uncertainties of the 
beam monitors. 

The effect of the proton beam profile uncertainty on 
the flux is studied by varying the parameters in Table [TT] 
within their uncertainties while accounting for the pa- 
rameter correlations. The uncertainties on Y and Y' are 
dominant and are studied on a simulated "wide beam" 
flux sample that has a profile in the y — y' (proton ver- 
tical position and angle) plane that covers the measured 
uncertainties. The wide beam sample is weighted for vari- 
ations of Y and Y' and the effect on the flux is studied. 



C. Hadron production data, weighting and 
uncertainties 

The pion and kaon differential production measure- 
ments we use to weight the T2K flux predictions are 
summarized in Table HTT1 

We weight charged meson differential production mul- 
tiplicities to the NA61/SHINE tt+ /tt" [34] and K+ [45] 
thin target production data, which covers most of phase 
space relevant for the off-axis flux. We use additional 
kaon differential production data from Eichten et al. |51] 
and Allaby et al. [52] to weight K + multiplicities in the 
phase space not covered by the NA61/SHINE measure- 
ments, and for K~ multiplicities. To estimate the un- 
certainty of pion production by secondary protons, we 
use differential pion production data from the BNL-E910 
experiment [53) that were collected in interactions with 
proton beam energies less than the T2K primary proton 
beam energy. 

We use measurements of the inelastic cross sections for 
proton, pion, and kaon beams with carbon and aluminum 
targets [54-64. to weight based on particle interaction 
and absorption rates in the flux prediction. In particular, 
NA61/SHINE measures the inclusive "production" cross 
section of 31 GeV/c protons on carbon: cr pro d = 229.3 ± 
9.2 mb The production cross section is defined as: 

Cprod — &inel @qe- (5) 
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Here, a qe is the quasi-elastic scattering cross section, i.e. 
scattering off of individual bound nucleons that breaks 
up or excites the nucleus, but docs not produce addi- 
tional hadrons. The inclusive production cross section 
is used in the weighting of the flux prediction, and the 
quasi-elastic cross section is subtracted from measure- 
ments where necessary. 

We apply hadron interaction-based weights to simu- 
lated events in two steps. The multiplicity of pions and 
kaons produced in interactions of nucleons on the target 
nuclei is defined as: 



dp Oprod dp 



(6) 



Here p and are the momentum and angle relative to 
the incident particle of the produced particle in the lab 
frame. We apply multiplicity weights that are the ratio 
of the measured and simulated differential multiplicities: 



w(j>,e)= [ % ip ' e)]data 



I dp 



(7) 



MC 



We adjust the interaction rates of protons, charged pi- 
ons and charged kaons as well, with weights that account 
for attenuation in the target: 



W 



°prod -x{tr' j-a vrod )p 



fprod 



(8) 



Here p is the number density of nuclear targets in the ma- 
terial, a VTO d is the original inclusive production cross sec- 
tion in the simulation, cr' prod is the inclusive production 
cross section to which the simulation is being weighted, 
and x is the distance traversed by the particle through 
the material. The total weight is the product of weights 
from all materials through which the particle propagates. 

For pion and kaon production in secondary nucleon 
interactions, or in the phase space covered by the alter- 
native kaon production data sets, we converted weights 
to an xf ~Pt dependence, where pt is the transverse mo- 
mentum of the produced particle and Xf is the Feynman 
x 165 defined as: 



xf = Pl/Pt, 



(9) 



Here pl is the longitudinal momentum of the produced 
particle in the center of mass frame, and p ma x is the 
maximum momentum the produced particle can have. 
We apply the xp—pr dependent weights after converting 
simulated hadron interactions to the xf ~Pt basis. This 
method assumes that the pion and kaon multiplicities 
expressed in the xp — Pt basis are independent of the 
collision center of mass energy. 

The effect of the hadron interaction weighting on the 
SK and v e flux are shown as the ratios of weighted to 
nominal flux in Fig. [6] The weighting of pion multiplici- 
ties is a 10% effect at low energy, while the weighting of 
kaon multiplicities affects the flux by as much as 30% in 



the high energy tail. The large weighting effect for kaons 
is due to the underestimation of kaon production above 
kaon momenta of 3 GeV/c in the simulation. The effect 
of the inclusive production cross section weighting on the 
flux prediction is less than 4% for all energies. 
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FIG. 6: Ratio of the hadron interaction weighted flux to 
the nominal flux for (a), v e (b) flux predictions at 

SK. The effects of the pion production, kaon production 
and inclusive production cross section weighting are 
shown separately and in total. 

The uncertainties on the hadron multiplicity measure- 
ments contribute to the total uncertainty on the flux. 
Typical NA61/SHINE tt* data points have ~ 7% system- 
atic error, corresponding to a maximum uncertainty of 
6% on the flux. In addition, we evaluate uncertainties on 
the Xf scaling assumption (less than 3%), and regions of 
the pion phase space not covered by data (less than 2%). 
The dominant source of uncertainty on the kaon produc- 
tion is the statistical uncertainty on the NA61/SHINE 
measurements. 

The uncertainties on the inclusive production cross sec- 
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tion measurements reflect the discrepancies that are seen 
between different measurements at similar incident par- 
ticle energies. These discrepancies are similar in size to 
a qe and may arise from ambiguities in the actual quan- 
tity being measured by each experiment. We apply an 
uncertainty equal to the a qe component to the inclusive 
production cross section measurements (typically larger 
than the individual measurement errors), and the un- 
certainty propagated to the flux is less than 8% for all 
energies. 

We apply an additional uncertainty to the produc- 
tion of secondary nucleons, for which no adjustments are 
made in the current flux prediction The uncertainty is 
based on the discrepancy between the FLUKA model- 
ing of secondary nucleon production and measurements 
by Eichten et. al. [51] and Allaby et. al. [52 . The un- 
certainty propagated to the flux is less than 10% for all 
energies. 

The neutrino energy-dependent hadron interaction un- 
certainties on the SK and v e flux predictions are sum- 
marized in Fig. [7J and represent the dominant source of 
uncertainty on the flux prediction. 



D. Horn and target alignment and uncertainties 

The horns are aligned relative to the primary beam line 
with uncertainties of 0.3 mm in the transverse x direc- 
tion and 1.0 mm in the transverse y direction and beam 
direction. The precision of the horn angular alignment is 
0.2 mrad. After installation in the first horn, both ends 
of the target were surveyed, and the target was found 
to be tilted from its intended orientation by 1.3 mrad. 
We have not included this misalignment in the nominal 
flux calculation, but the effect is simulated and included 
as an uncertainty. We also simulate linear and angular 
displacements of the horns within their alignment uncer- 
tainties and evaluate the effect on the flux. The total 
alignment uncertainty on the flux is less than 3% near 
the flux peak. 
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FIG. 7: The fractional hadron interaction errors on 
(a), v e (b) flux predictions at SK. 



of the horn current, and the effect on the flux is 2% near 
the peak. 



E. Horn current, field and uncertainties 



We assume a 1/r dependence of the magnetic field in 
the flux simulation. The validity of this assumption is 
confirmed by measuring the horn field using a Hall probe. 
The maximum deviation from the calculated values is 2% 
for the first horn and less than 1% for the second and 
third horns. Inside the inner conductor of a spare first 
horn, we observe an anomalous field transverse to the 
horn axis with a maximum strength of 0.065 T. Flux sim- 
ulations including the anomalous field show deviations 
from the nominal flux of up to 4%, but only for energies 
greater than 1 GeV. 

The absolute horn current measurement uncertainty is 
2% and arises from the uncertainty in the horn current 
monitoring. We simulate the flux with ±5 kA variations 



F. Off-axis angle constraint from INGRID 

The muon monitor indirectly measures the neutrino 
beam direction by detecting the muons from meson de- 
cays, while the INGRID on-axis neutrino detector di- 
rectly measures the neutrino beam direction. The dom- 
inant source of uncertainty on the beam direction con- 
straint is the systematic uncertainty on the INGRID 
beam profile measurement, corresponding to a 0.35 mrad 
uncertainty. We evaluate the effect on the flux when the 
SK or ND280 off-axis detectors are shifted in the simu- 
lation by 0.35 mrad. 
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G. Summary of flux model and uncertainties 

The T2K flux predictions at the ND280 and SK detec- 
tors have been described and are shown in Fig. [5} We 
use the flux predictions as inputs to calculate event rates 
at both the ND280 and SK detectors. To evaluate the 
flux related uncertainties on the event rate predictions, 
we evaluate the fractional uncertainties on the flux pre- 
diction in bins of energy for each neutrino flavor. The 
bin edges are: 

• v M : 0.0, 0.4, 0.5, 0.6, 0.7, 1.0, 1.5, 2.5, 3.5, 5.0, 7.0, 
30.0 GeV 

• 2/ M : 0.0, 1.5, 30.0 GeV 

• v e : 0.0, 0.5, 0.7, 0.8, 1.5, 2.5, 4.0, 30.0 GeV 

• u e : 0.0, 2.5, 30.0 GeV 

We choose coarse binning for the antineutrino fluxes since 
they make a negligible contribution for the event samples 
described in this paper. The neutrino flux has finer bins 
around the oscillation maximum and coarser bins where 
the flux prediction uncertainties are strongly correlated. 

The uncertainties on the ND280 SK and SK v e 
flux predictions are shown in Fig. [S] and the correlations 
are shown in Fig. [9] The correlations shown are evaluated 
for the binning described above. The ND280 and SK 
flux predictions have large correlations, indicating the 
interaction rate at the near detector can constrain the 
unoscillated interaction rate at the far detector. The 
SK v P flux is also correlated with the ND280 flux, since 



the Vn and v e both originate from the 7r 



I 1 ' 



decay 



chain or kaon decays. This correlation also allows us to 
constrain the expected intrinsic v e rate at the far detector 
by measuring interactions at the near detector. 



V. NEUTRINO INTERACTION MODEL 

We input the predicted neutrino flux at the ND280 
and SK detectors to the NEUT jM] neutrino interaction 
generator to simulate neutrino interactions in the detec- 
tors. Fig. [TU] illustrates the neutrino-nucleon scattering 
processes modeled by NEUT at the T2K beam energies. 
The dominant interaction at the T2K beam peak energy 
is charged current quasi-elastic scattering (CCQE): 



V( 



+ N -+£ + N', 



(10) 



where £ is the corresponding charged lepton associated 
with the neutrino's flavor (electron or muon), and N and 
N' are the initial and final state nucleons. Above the pion 
production threshold, single pion production contributes 
to charged current interactions (CCl7r): 



+ N -> £ + N' + ■ 



and neutral current interactions (NCl7r): 
v + N ^ v + N' + iT. 
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FIG. 8: The fractional uncertainties on the ND280 f M , 
SK and SK v e flux evaluated for the binning used in 
this analysis. This binning is coarser than the binning 
shown in Fig. [7] and includes the correlations between 
merged bins. 



In the high energy tail of the T2K flux, multi-pion and 
deep inelastic scattering (DIS) processes become domi- 
nant. 



A. NEUT simulation models 

CCQE interactions in NEUT arc simulated using the 
model of Llewellyn Smith [ST], with nuclear effects de- 
scribed by the relativistic Fermi gas model of Smith and 
Moniz [HHl HH]- Dipole forms for the vector and axial- 
vector form factors in the Llewellyn Smith model are 
used, with characteristic masses My = 0.84 GeV and 
Ma = 1.21 GeV respectively in the default simulation. 
The Fermi momentum pp is set to 217MeV/c for carbon 
and 225 MeV/c for oxygen, and the binding energy is set 
to 25 MeV for carbon and 27 MeV for oxygen. 

NEUT simulates the production of pions via the excita- 
tion of hadronic resonances using the model of Rein and 
Sehgal [TO] , The simulation includes 18 resonances below 
2 GeV, along with interference terms. In the energy range 
relevant for T2K, resonance production is dominated by 
the A(1232). For 20% of the As produced within a nu- 
cleus, NEUT also simulates pion-less A decay, in which 
the A de-excites in the nuclear medium without the emis- 
sion of pions. NEUT includes the production of pions in 
coherent scattering of the neutrino on the target nucleus 
based on the Rein and Sehgal model. 

Multi-pion and DIS interactions in NEUT are simu- 
lated using the GRV98 parton distribution functions [71] . 
Where the invariant mass of the outgoing hadronic sys- 
tem (W) is in the range 1.3 < W < 2.0 GeV/c 2 , a cus- 
tom program is used |72) . and only pion multiplicities of 
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FIG. 9: The correlations of the flux uncertainties in the 6j bins for the ND280 Vn and SK Vn and v e fluxes. The axes 
are the bins in neutrino energy for each flavor/detector combination and are proportional to the neutrino energy up 

to 10 GeV. 



greater than one are considered to avoid double counting 
with the Rein and Sehgal model. For W > 2.0 GeV/c 2 
PYTHIA / JETSET [73j is used. Corrections to the small 
Q 2 region developed by Bodek and Yang are applied [71] . 

NEUT uses a cascade model to simulate the interac- 
tions of hadrons as they propagate through the nucleus. 
For pions with momentum below 500 MeV/c, the method 
of Salcedo et at |75j is used. Above pion momentum of 
500 MeV/c the scattering cross sections are modeled us- 
ing measurements of tt scattering on free protons |76j . 

Additional details on the NEUT simulation can be 
found elsewhere 1511. 



are chosen and their central values and uncertainties are 
set to cover the systematic uncertainties on the interac- 
tion models derived from comparisons of NEUT to exter- 
nal data or alternative models. They are a combination 
of free parameters in the NEUT model and ad-hoc empir- 
ical parameters. The parameter values and uncertainties 
are further constrained by the fit to neutrino data from 
the T2K ND280 detector, as described in Section \VU\ 
To tune the NEUT model parameters and evaluate the 
effect of neutrino interaction uncertainties, adjustments 
are carried out by applying weights to simulated NEUT 
event samples from T2K or external experiments, such 
as MiniBooNE. 



B. Methods for varying the NEUT model 



Uncertainties in modeling neutrino interactions are a 
significant contribution to the overall systematic uncer- 
tainty in the v e appearance analysis reported in this pa- 
per. In the rest of this section, we describe these uncer- 
tainties with nuisance parameters that vary the NEUT 
interaction models. The parameters, listed in Table IV 



C. NEUT model comparisons to external data and 
tuning 

A detailed description of the NEUT model tuning 
using external data comparisons can be found in Ap- 
pendix [XJ Here we provide a brief summary. 
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TABLE IV: The parameters used to vary the NEUT cross section model and a brief description of each parameter. 



CCQE Cross Section 

~M T ^ E The mass parameter in the axial dipole form factor for quasi-elastic interactions 

x? E The normalization of the quasi-elastic cross section for E v < 1.5 GeV 

x qe rQ ic norma jj za ti on f tjj e quasi-elastic cross section for 1.5 < E v < 3.5 GeV 

x® E The normalization of the quasi-elastic cross section for E v > 3.5 GeV 
Nuclear Model for CCQE Interactions (separate parameters for interactions on O and C) 

xsf Smoothly changes from a relativistic Fermi gas nuclear model to a spectral function model 

PF The Fermi surface momentum in the relativistic Fermi gas model 

Resonant Pion Production Cross Section 

M^ 3 The mass parameter in the axial dipole form factor for resonant pion production interactions 

x ccin rpjjg normalization of the CC resonant pion production cross section for E v < 2.5 GeV 

x ccin rp^g norm alization of the CC resonant pion production cross section for E v > 2.5 GeV 

x NCln The normalization of the NCl7r° cross section 

xittE„ Varies the energy dependence of the In cross section for better agreement with MiniBooNE data 
W e g Varies the distribution of Ntt invariant mass in resonant production 

x-K-iess Varies the fraction of A resonances that decay or are absorbed without producing a pion 
Other 

x UUcoh - The normalization of CC coherent pion production 

x NCcoh. rp^g norma lization of NC coherent pion production 

x NCother norma li za tion of NC interactions other than NCl7r° production 

xccother Varies the CC multi-7r cross section normalization, with a larger effect at lower energy 

xfsi Parameters that vary the microscopic pion scattering cross sections used in the FSI model 

x Vc / v ^ Varies the ratio of the CC v e and cross sections 
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FIG. 10: The NEUT interaction cross section per 
nucleon on 16 O with a breakdown by interaction 
process. The "NC Other" curve includes neutral 
current coherent pion production, resonant charged 
pion production, multi-pion production and deep 
inelastic scattering. The predicted flux spectrum at 
SK with no oscillations is shown for comparison. 



1. FSI model tuning and uncertainty 

The NEUT FSI model includes parameters which al- 
ter the microscopic pion interaction probabilities in the 
nuclear medium. The central values of these parameters 
and their uncertainties are determined from fits to pion 
scattering data [77H75] . We consider variations of the 
FSI parameters within the uncertainties from the fit of 
the pion scattering data, and evaluate the uncertainties 
on the predicted event rates for ND280 and SK selections. 



2. CCQE model uncertainty 

The most detailed measurement of CCQE scattering 
on light nuclei in the region of 1 GeV neutrino energy has 
been made by MiniBooNE, which has produced double- 
differential cross sections in the muon kinetic energy and 
angle, (Ij,, cos 6> M ) jSD]. We compare the agreement of 
NEUT to the MiniBooNE CCQE data in addition to our 
own nea r detector measurement of CCQE events (Sec- 
tion 



VI I since the MiniBooNE detector has 4tt accep- 



tance, providing a kinematic acceptance of the leptons 
that more closely matches the SK acceptance for the se- 
lection described in Section IVIIII This is illustrated in 
Fig. 11 which compares the predicted true Q 2 distribu- 



tions for CCQE events in the ND280 CCQE selection, 
the MiniBooNE CCQE selection, and the SK selection 
for v e appearance candidates. 

In order to allow the ND280 data to constrain the 
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FIG. 11: The predicted Q 2 distributions for CCQE 
interactions in the ND280 CCQE selection, the 
MiniBooNE CCQE selection, and the SK v e appearance 
selection. 



CCQE model, we use the difference of the NEUT nom- 
inal value and the best-fit value from fit to MiniBooNE 
data to set the uncertainty on M® E , o m qe = 0.43 GeV. 
We also set the uncertainty on the low energy CCQE 
normalization, xf E , to the size of the MiniBooNE flux 
uncertainty, 11%. The results of the MiniBooNE fit are 
discussed in more detail in Appendix |A} 

To allow for the discrepancy in CCQE cross section at 
0(1) GeV measured by MiniBooNE and at 0(10) GeV 
measured by NOMAD [ST], we employ independent 
CCQE normalization factors for (1.5 < E v < 3.5) GeV 
{x2 E ) and E v > 3.5 GeV (xf E ), each with a prior un- 
certainty of 30% and a nominal value of unity. 

Alternate explanations have been proposed to recon- 
cile the MiniBooNE data with a M$ E 1.0 GeV de- 
rived from electron scattering and NOMAD data [82"H5rj] . 
These models typically modify the cross section either by 
enhancing the transverse component of the cross section, 
or by adding an additional multi-nucleon process to the 
existing cross section, where the neutrino interacts on a 
correlated pair of nucleons. Future improvements to the 
NEUT generator may include a full implementation of al- 
ternate CCQE models. However, these models would also 
require modifications to the kinematics of the exiting nu- 
cleons, but no consensus has been reached yet in the field 
as to how the nucleons should be treated. We consider 
two possible effects of alternate CCQE models on the 
v e appearance analysis. First, the effect in Q 2 for these 
models is often similar to increasing M® E and [55] shows 
that other improvements to the CCQE cross section can 
be represented by an experiment-specific M® E (effective) , 
so the increase to the overall cross section from these 
models is approximately covered by the uncertainty on 
M® E • Second, a multi-nucleon process would appear 
as a CCQE-like interaction in the SK detector, but the 
relationship between the neutrino energy and the lep- 



TABLE V: Parameters used in the single pion fits, and 
their best-fit values and uncertainties. The la value of 

the penalty term is shown for parameters which are 
penalized in the fit. Where parameters are defined in a 

manner consistent with the T2K data fits, the same 
parameter name is used. 



Nominal value Penalty best-fit Error 



M^ s (GeV) 


1.21 




1.16 


0.10 




1 




0.48 


0.14 


XCCother 





0.40 


0.36 


0.39 


Normalizations: 










x CCcoh 


1 




0.66 


0.70 




1 




1.63 


0.32 


x NCcoh 


1 


0.30 


0.96 


0.30 




1 




1.19 


0.36 


NC Itt* 


1 


0.30 


0.98 


0.30 


NC multi-pion/DIS 


1 


0.30 


0.99 


0.30 



ton kinematics is different than for quasi-elastic scatters, 
which may affect the determination of oscillation param- 
eters. Other processes also appear CCQE-like and have a 
different relationship between lepton kinematics and neu- 
trino energy, such as non-QE events with no pions in the 
final state (pion- less A decay) . The uncertainty on these 
events indirectly accounts for the effect of multi-nucleon 
models as these events affect the extracted oscillation pa- 
rameters in a way similar to how multi-nucleon models 
would. 



3. Single pion production model tuning and uncertainty 

Measurements of single pion production cross sections 
on light nuclei in the T2K energy range have been made 
by MiniBooNE [57H55] . and K2K, which used a 1000 ton 
water Chercnkov detector [5D]. We perform a joint fit to 
the MiniBooNE measurements of charged current single 
7r + production (CCl7r + ), charged current single 7r° pro- 
duction (CCl7r°) and neutral current single ir° produc- 
tion (NCl7r°). As shown in Appendix|Aj we compare the 
NEUT best-fit derived from the MiniBooNE single pion 
data with the K2K measurement, which is of particular 
interest since it is the same nuclear target as SK. 

The parameters listed in Table [V] are varied in the fit 
to the MiniBooNE single pion data and their best-fit val- 
ues and uncertainties are listed. The parameters include 
M^ ES , the axial mass in the Rein and Sehgal model, the 
empirical parameter, W c g, discussed in the next para- 
graph, and parameters that vary the normalization of 
various interaction modes. Contributions to the samples 
from CC multi-pion/DIS (xecother) interactions, NC co- 
herent interactions, NCl^ interactions and NC multi- 
pion/DIS interactions are relatively small, so the Mini- 
BooNE samples have little power to constrain the asso- 
ciated parameters which are discussed in Section |V C 4| 
Penalty terms for these parameters are applied using the 
prior uncertainties listed in Table [Vj 
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The W c g parameter alters the single pion differential 
cross section as a function of pion-nucleon invariant mass 
W, providing a means to change the shape of the NEUT 
prediction for NCl7r° da/dp^o differential cross section. 
Uncertainties in the NCl7r° pion momentum distribution 
enter into the v e appearance analysis, as the momentum 
and angular distributions of v e candidates from NCl7r° 
interactions depend on the kinematic distribution of the 
7T°. The NEUT predicted p n o spectrum, shown in the 
bottom plot of Fig. [12] is broader than the observed Mini- 
B00NE data. A decrease to the W c g parameter results in 
a more sharply-peaked p n o spectrum, and achieves agree- 
ment between the NEUT prediction and the measured 
cross section; W e g does not alter the total cross section. 
Future changes to the NEUT model that may eliminate 
the need for W c s include refinements of the treatment of 
formation time effects, which have been shown to affect 
the pion momentum distribution |91) . or modifications 
to the contribution of higher order resonances relative to 
A(1232). 



P^CGeV/c) 



The fitted data and NEUT model are shown in Fig. [12] 
We propagate the fitted parameter values for Mj±°, 
x cc\tt an( j x nc\-k an£ } tfigrr correlated uncertainties to 
the fits of ND280 and SK data. The remaining param- 
eters from the fit to MiniBooNE data are marginalized. 
We evaluate additional uncertainties on these parame- 
ters by re-running the fit to MiniBooNE data with vari- 
ations of the FSI model and pion-less A decay turned 
off. The deviations of the fitted parameter values due to 
these FSI or pion-less A decay variations are applied as 
parameter uncertainties, increasing the uncertainties on 
M% ES , xf cl7T and x NCl7r ° to 0.11 GeV, 0.43 and 0.43 
respectively. The fitted W c g parameter value is not ap- 
plied to the T2K predictions, but the difference between 
the nominal value of W e g and the best-fit value from the 
MiniBooNE data fit is treated as an uncertainty. 

An additional uncertainty in the energy-dependent 
pion production cross section is considered since we ob- 
serve a discrepancy between the fitted NEUT model and 
the MiniBooNE CC1tt + data, as shown in Fig. [l3} We 
introduce a parameter x\ke„ that represents the energy- 
dependent tuning which brings the NEUT prediction into 
agreement with the MiniBooNE data. Uncertainties on 
the ND280 and SK predictions include the difference be- 
tween the resonant pion production with and without 
this energy-dependent tuning. 



FIG. 12: Differential cross sections for CC17T+ Q 2 (top), 
CGItt Q 2 (middle) and NCItt p n o (bottom) used in 
the single-pion fits to MiniBooNE data, and the NEUT 
nominal and best-fit predictions. The MiniBooNE data 
point errors are statistical+systematic. 



The fits to MiniBooNE data constrain the normaliza- 
tion of CC17T resonant production below 2.5 GeV. Above 
2.5 GeV, we apply a separate normalization uncertainty 
of 40% on the parameter x^ cilx . This uncertainty cov- 
ers the maximum difference between MiniBooNE CCl7r + 
data and NEUT at E u ps 2 GeV and is conservative given 
the CC inclusive cross section measurements [92] made 
at higher energies. 
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FIG. 13: The CCl7r + cross section as a function of 
energy as measured by MiniBooNE, with the NEUT 
nominal and best-fit models. The treatment in the 
analysis of the disagreement between the best-fit NEUT 
and data is discussed in the text. 



4- Other interaction channels 

We evaluate the uncertainty on CC coherent pion pro- 
duction based on measurements by K2K [53] and Sci- 
BooNE [94] which place upper limits on the CC coher- 
ent production that are significantly less than the Rein 
and Sehgal model prediction. Since no clear CC coher- 
ent signal has been observed at 0(1) GeV , we apply a 
100% normalization uncertainty to the NEUT CC coher- 
ent pion production (x CCcoh ). 

SciBooNE's measurement of the NC coherent pion 
cross section at 0(1) GeV 95J is in good agreement 
with the Rein and Sehgal model prediction; the uncer- 
tainty on this channel is set to 30% based on the Sci- 
BooNE measurement and is represented by a normaliza- 



greater than 4 GeV, these modes dominate the inclusive 
cross section and are constrained by measurements of 
the inclusive cross section [95] with «10% uncertainties. 
At lower energies the constraint from the inclusive cross 
section measurements is weaker since other interactions 
modes are significant. Hence, we apply an uncertainty 
that is 10% at high energies and increases to 40% near 
the threshold for multi-pion production. The model is 
adjusted by applying a weight: 



w 



1 



XCCother 



(13) 



The parameter xccother is allowed to vary around a nom- 
inal value of with a prior uncertainty of 0.4 GeV. 



D. Nuclear model uncertainties 

NEUT models nuclei with a relativistic Fermi gas 
model (RFG) using a Fermi momentum pp from elec- 
tron scattering data [99]. We evaluate the uncertainty 
on the CCQE cross section for variations of pf within its 
uncertainty of 30 MeV/c. This uncertainty covers the un- 
certainty from the electron scattering data and has been 
inflated to cover possible discrepancies in the CCQE cross 
section at low Q 2 . The uncertainty is applied indepen- 
dently for interactions on carbon and oxygen targets. 

We also consider alternatives to the RFG model of the 
nuclei by making comparisons to a spectral function nu- 
clear model implemented in the NuWro neutrino inter- 
action generator |100j . The discrepancy in CCQE inter- 
actions models with the RFG and spectral function are 
assigned as uncertainty and represented by the parame- 
ter xsf which smoothly varies the predicted lepton kine- 
matics between the RFG (xsf = 0) and spectral function 
(xsf = 1) models. We apply the uncertainties for the nu- 
clear model independently for carbon and oxygen cross 
sections. 



E. 



cross section uncertainty 



tion parameter, x 



NCcoh 



Differences between and v e 



n NCother 



We define a single parameter also considered, as the CC v 



that varies the normalization of the NC reso- 
nant 7r , NC elastic and NC multi-pion/DIS/other reso- 
nant modes. The uncertainty on this normalization pa- 
rameter is set to 30%. As there is little NC resonant 7r 
data, the uncertainty on the NC resonant processes 
is se t to be the same size as the agreement shown in Sec- 
tion VC3 for the NC resonant l7r° cross section (30%). 



The NC multi-pion and DIS model was tuned to agree 
with the CC/NC data using the NEUT predicted CC DIS 
cross section; the uncertainties on this phenomenological 
model are set to cover the size of the uncertainties of the 
CC/NC data [96l [97] (30%). 

The CC multi-pion/DIS interactions contribute to the 

At 



ND280 samples discussed in Section VI 



energies 



in the cross section are 
M sample at ND280 is used 
to infer the CC v e rate at the far detector. The spec- 
tral function uncertainty is calculated separately for 
and v e as well as target material. In addition, an overall 
3% uncertainty on the ratio of and v e CC ncutrino- 
nucleon cross sections (x Ve / v ) is included, based on cal- 
culations [101J over T2K's energy range. 



F. Summary of the neutrino cross section model, 
tuning and uncertainties 

The cross section model parameters values and uncer- 
tainties are listed in Table |VI[ These priors are used as 
inputs to fits to the T2K ND280 and SK data sets, and 
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TABLE VI: The parameters used to vary the NEUT 
cross section model along with the values used in the 
ND280 fit (input value) and uncertainties prior to the 
ND280 and SK data fits. 



Parameter Input Value Uncertainty 



(GeV) 


1.21 


0.43 




1.00 


0.11 


x 2 


1.00 


0.30 


x 3 


1.00 


0.30 


X.SF 


0.0 


1.0 


p F ( 12 C) (MeV/c) 


217 


30 


p F { 16 0) (MeV/c) 


225 


30 


M% ES (GeV) 


1.16 


0.11 




1.63 


0.43 




1.00 


0.40 


x NCln° 


1.19 


0.43 


X\irE v 


off 


on 


w cS 


1.0 


0.51 


X-^ — less 


0.2 


0.2 


x CCcoh. 


1.0 


1.0 


x NCcoh. 


1.0 


0.3 


x NCother 


1.0 


0.3 


xccother (GeV) 


0.0 


0.4 




1.0 


0.03 



A. ND280 simulation 

The ND280 detector response is modeled with a 
GEANT4-based [1021 EGS] Monte Carlo (MC) simula- 
tion, using the neutrino flux described in Section |IV| and 
the NEUT simulation. The MC predictions presented 
in this section are not calculated with the cross section 
parameter tuning described in Table |Vj Neutrino inter- 
actions are generated up to 30 GeV for all flavors from 
the unoscillated flux prediction, with a time distribution 
matching the beam bunch structure. The ND280 sub- 
detectors and magnet are represented with a detailed 
geometrical model. To properly represent the neutrino 
flux across a wider range of off-axis angles, a separate 
simulation is run to model neutrino interactions in the 
concrete and sand which surround ND280. The scintilla- 
tor detectors, including the FGD, use custom models of 
the scintillator photon yield, photon propagation includ- 
ing reflections and attenuation, and electronics response 
and noise. The gaseous TPC detector simulation includes 
the gas ionization, transverse and longitudinal diffusion 
of the electrons, propagation of the electrons to the read- 
out plane through the magnetic and electric field, and 
a parametrization of the electronics response. Further 
details of the simulation of the individual detectors of 
ND280 can be found in Refs [2UIH]. 



include the results of the MiniBooNE single pion model 
fit. For parameters related to the nuclear modeling, such 
as xsf, Pf( 12 G) and pf ( 16 0), we apply separate uncor- 
rected parameters for the modeling of interactions on 
12 C and 16 0. Hence, the fit to ND280 data does not con- 
strain the nuclear modeling parameters used when mod- 
eling interactions at SK. Of the remaining parameters, we 
treat them as correlated for ND280 and SK if they are 
strongly constrained by ND280 data. These parameters 
include the CCQE cross section parameters, M? E , xf E , 
and the CC17T cross section parameters, M^ E , xf ci7r . 
To preserve the correlations between NC and CC param- 
eters from the fit to MiniBooNE single pion data, x NCl7T 
is also propagated. All other parameters are not well con- 
strained by the ND280 data and are applied separately 
for ND280 and SK interaction modeling. 



B. 



candidate selection 



VI. 



ND280 NEUTRINO DATA 



We select samples of CC interactions in the ND280 
detector, which are fitted to constrain the flux and cross 
section models, as described in Section VII CC in- 



teraction candidates are divided into two selections, one 
enhanced in CCQE-like events, and the second consist- 
ing of all other CC interactions, which we refer to as the 
CCnonQE-like selection. While the v e flux and interac- 
tion models are constrained by the CC data, we also 
select a sample enhanced in CC v e interactions to directly 
verify the modeling of the intrinsic v e rate. 



We select CC interactions by identifying the muons 
from u^N — > fi~ X interactions, which may be accom- 
panied by hadronic activity X from the same vertex. 
Of all negatively charged tracks, we identify the high- 
est momentum track in each event that originates in 
the upstream FGD (FGD1) and enters the middle TPC 
(TPC2) as the /i~ candidate. The negatively charged 
track is identified using curvature and must start inside 
the FGD1 fiducial volume (FV) that begins 48 mm in- 
ward from the edges of FGD1 in x and y and 21 mm in- 
ward from the upstream FGD1 edge in z. In this analysis 
we use only selected tracks with a vertex in FGD1, since it 
provides a homogeneous target for neutrino interactions. 
To reduce the contribution from neutrino interactions up- 
stream of the FGD1 FV, any tracks which pass through 
both the upstream TPC (TPC1) and FGD1 are rejected. 
This also has the consequence of vetoing backward-going 
particles from the CC interaction vertex, so the resulting 
selection is predominantly forward-going fj," . 

The fi~ candidate track energy loss is required to be 
consistent with a muon. The identification of particles 
(PID) is based on a truncated mean of measurements of 
energy loss in the TPC gas, from which a discriminator 
function is calculated for different particle hypotheses. 
We apply the discriminator to select muon candidates 
and reject electron and proton tracks. The TPC PID 
and TPC performance are described in more detail else- 
where HU1. 



18 



Events passing the previously described cuts comprise 
the CC-inclusive sample, and the number of selected 
events and the MC predictions are listed in Table |VII| 
These data correspond to 2.66 x 10 20 POT. The predic- 
tions include a correction for the event pile-up that is not 
directly modeled by the Monte Carlo simulation of the 
detector. The pile-up correction takes into account the 
presence of neutrino interactions in the same beam bunch 
originating in the sand and material surrounding the de- 
tector. The size of this correction ranges between 0.5% 
and 1% for the different run periods. Of CC i/„ interac- 
tions in the FGD1 FV, 47.6% are accepted by the CC- 
inclusive selection, and the resulting selection is 88.1% 
pure. The largest inefficiency of the CC-inclusive selec- 
tion is from high angle particles which do not traverse a 
sufficient distance through the TPC to pass the selection 
criteria. 

We divide the CC-inclusive events into two mutu- 
ally exclusive samples sensitive to different neutrino in- 
teraction types: CCQE-like and CCnonQE-like. As the 
CCQE neutrino interaction component typically has one 
muon and no pions in the final state, we separate the 
two samples by requiring the following for the CCQE- 
like events: 

• Only one muon-like track in the final state 

• No additional tracks which pass through both 
FGD1 and TPC2. 

• No electrons from muon decay at rest in FGD1 
(Michel electron) 

A Michel electron will typically correspond to a stopped 
or low energy pion that decays to a muon which stops 
in FGD1, and is identified by looking for a time-delayed 
series of hits in FGD1. The Michel electron tagging ef- 
ficiency is 59%. Events in the CC-inclusive selection 
which do not pass the CCQE-like selection comprise 
the CCnonQE-like sample. Example event displays for 



ND280 events are shown in Fig. 14 

The numbers of selected events in the data and nomi- 
nal prediction for the CCQE-like and CCnonQE-like se- 
lections are shown in Table rVTTTl Table HXl shows the com- 
position of the CC, CCQE-like and CCnonQE-like selec- 
tions according to the generated neutrino interaction cat- 
egories in the Monte Carlo. The CCQE-like sample con- 
tains 40.0% of all CCQE interactions in the FGD1 FV, 
and CCQE interactions comprise 69.5% of the CCQE-like 
sample. 

Fig. [15] shows the distributions of events binned in the 
muon momentum (p^) and cosine of the angle between 
the muon direction and the z-axis (cos (9^) for both data 
and the prediction. In addition, we check the stability of 
the neutrino interaction rate with a Kolmogorov-Smirnov 
(KS) test of the accumulated data and find p-values of 
0.20, 0.12, and 0.79 for the CC-inclusive, CCQE-like and 
CCnonQE-like samples, respectively. 

Both CCQE-like and CCnonQE-like samples provide 
useful constraints on the neutrino flux and neutrino in- 




FIG. 14: Event displays of example ND280 CCQE-like 
(a) and CCnonQE-like (b) selected events. 

TABLE VII: Number of data and predicted events for 
the ND280 CC-inclusive selection criteria. 





Data 


MC 


Good negative track in FV 
Upstream TPC veto 
H PID 


21503 
21479 
11055 


21939 
21906 
11498 



teraction models. The CCQE-like sample includes the 
dominant neutrino interaction process at the T2K beam 
peak energy (CCQE) and the CCnonQE-like sample is 
sensitive to the high energy tail of the neutrino flux, 
where relatively few CCQE interactions occur. The fit of 
the flux and cross section models to these data, further 
described in Section VII uses two-dimensional p^ and 
cos# M distributions for the CCQE-like and CCnonQE- 
like samples. We use a total of 20 bins per each sample, 
where p^ is split into 5 bins and cos 9^ is split into 4 
bins. The data and the expected number of events for 
this binning are shown in Table |X| 



C. Detector Response Modeling Uncertainties 

We consider systematic uncertainties on the modeling 
of the detection efficiency and reconstruction of events 
which affect: 

• the overall efficiency for selecting CC interactions 
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FIG. 15: Muon momentum for the CC-inclusive (a), CCQE-like (c), and CCnonQE-like (e) samples. Cosine of the 
muon angle for the CC-inclusive (b), CCQE-like (d), and CCnonQE-like (f) samples. The errors on the data points 

are the statistical errors. 



• the reconstructed track properties cos# M ) 

• the sample (either CCQE-like or CCnonQE-like) in 
which the event is placed 

We estimate uncertainties from each category with a va- 
riety of control samples that include beam data, cosmic 
events and simulated events. 



The uncertainty on the efficiency for selecting CC 
interactions is propagated from uncertainties on: the 
data quality criteria applied to the tracks, track recon- 
struction and matching efficiencies, PID, and determina- 
tion of the track curvature. We also consider the uncer- 
tainty on the detector mass. 

The systematic uncertainty on the track momentum 
determination is from uncertainties on the magnetic field 
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TABLE VIII: Number of data and predicted events for 
the ND280 CCQE-like and CCnonQE-like selection 
criteria, after the CC-inclusive selection has been 
applied. 





CCQE-like 


CCnonQE-like 




Data 


MC 


Data 


MC 


TPC-FGD track 
Michel electron 


6238 
5841 


6685 
6244 


4817 
5214 


4813 
5254 



TABLE IX: Breakdown of the three ND280 CC samples 
by true interaction type as predicted by the MC 
simulation. 



Event type 


CC-inclusive 


CCQE-like 


CCnonQE-like 


CCQE 


44.4 


69.5 


14.7 


CC resonant In 


21.4 


14.5 


29.6 


CC coherent 7r 


2.8 


1.7 


4.0 


All other CC 


18.8 


3.7 


36.8 


NC 


3.0 


1.3 


5.1 


P„ 


0.7 


0.2 


1.2 


out of FV 


7.8 


7.6 


8.0 


sand interactions 


1.1 


1.6 


0.5 



absolute value and field non-uniformity. Small imperfec- 
tions in the magnetic and electric fields can affect the 
path of the drift electrons, causing a distorted image of 
the track and a possible bias in the reconstructed momen- 
tum. The size of these distortions is constrained from 
laser calibration data and MC simulations using mag- 
netic field measurements made prior to detector installa- 
tion. The overall momentum scale is determined from the 
magnitude of the magnetic field component transverse to 
the beam direction, B Xl which is inferred from the mea- 
sured magnetic coil current. The momentum resolution is 
determined in data from studies of tracks which traverse 
multiple TPCs; the individual momentum calculated for 
a single TPC can be compared to the momentum deter- 
mined by nearby TPCs to infer the momentum resolution 
in data and MC simulation. 

The primary causes of event migration between the 
CCQE-like and CCnonQE-like samples are external 
backgrounds or interactions of pions. External back- 
grounds in the samples are due to three sources: cos- 
mic rays, neutrino interactions upstream in the surround- 
ing sand and concrete, and neutrino interactions in the 
ND280 detector outside the FV (out of FV). Interactions 
from the sand or concrete contribute to the number of 
tracks in the selected event, which can change a CCQE- 
like event to a CCnonQE-like event. Interactions that 
occur outside of the FGD1 FV are about 7.6% of the 
total selected CC-inclusive sample. Sources include neu- 
trino interactions in FGD1 outside of the FV, or particles 
produced in interactions downstream of FGD1 that travel 
backwards to stop in the FGD1 FV. Pion absorption and 
charge exchange interactions in the FGD material can 
also reduce the probability that a charged pion produces 
a track in TPC2, affecting the identification of an event 



as CCQE-like or CCnonQE-like. The uncertainty on the 
GEANT4 modeling of pion inelastic scattering is evalu- 
ated by comparing the GEANT4 model to pion scattering 
data. 

For each source of systematic uncertainty, we generate 
a 40 x 40 covariance matrix with entries for each pair 
of (p^jCOS^) bins. These matrices represent the frac- 
tional uncertainty on the predicted numbers of events 
in each (p^fiosO^) bin for each error source. The bin- 
ning used is the same as shown in Table |Xj where the 
first 20 bins correspond to the CCQE-like sample and 
the second 20 correspond to the CCnonQE-like sample. 
The total covariance matrix Vd is generated by linearly 
summing the covariance matrices for each of the system- 
atic uncertainties. Fig. 16 shows the bin-to-bin correla- 



tions from the covariance matrix, which displays the fea- 
ture of anti-correlations between bins in the CCQE-like 
and CCnonQE-like samples arising from systematic error 
sources, such as the pion absorption uncertainty, that mi- 
grate simulated events between samples. Table [Xl] sum- 
marizes the range of uncertainties across the (p„,cos#„) 
bins and the uncertainty on the total number of events. 
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FIG. 16: The bin-to-bin correlation matrix from the 
systematic covariance matrix for the selected sample 
at ND280. The bins are ordered by increasing cos 9^ in 
groups of increasing muon momentum (p$ to p^) for the 
two selections. 



D. Intrinsic v e candidate selection 

We also select a sample of CC v e interactions to check 
the consistency of the predicted and measured intrinsic 
v e rates. The CC i/„ selections described earlier provide 
the strongest constraint on the expected intrinsic v e rate, 
through the significant correlation of the flux to the 
v e flux. However, a CC v e selection at the near detector 
provides a direct and independent measurement of the 
intrinsic v e rate. 
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TABLE X: Data (MC) p M and cos # M events split in bins as used by the fit described in Section VII at ND280. 

CCQE-like sample 





Pll (MeV/c) 




0-400 


400-500 


500-700 


700-900 


>900 


-1 < cos 6^ < 0.84 
0.84 < cos6» M < 0.90 
0.90 < cos0 M < 0.94 
0.94 < cos0 M < 1.0 


854 (807.7) 
110 (107.2) 
62 (69.1) 
92 (95.4) 


620 (655.6) 
110 (116.3) 
67 (74.0) 
73 (85.4) 


768 (821.2) 
235 (270.6) 
142 (179.0) 
184 (216.5) 


222 (255.0) 
133 (153.5) 
90 (121.4) 
160 (174.8) 


222 (233.0) 
159 (194.7) 
228 (274.6) 
1310 (1339.0) 


CCnonQE-like sample 




p„ (MeV/c 


) 




0-400 


400-500 


500-700 


700-900 


>900 


-1 < cos^ < 0.84 
0.84 < cos0 M < 0.90 
0.90 < cos0 M < 0.94 
0.94 < cos(9 M < 1.00 


560 (517.9) 
83 (80.3) 
46 (58.6) 
75 (76.6) 


262 (272.2) 
42 (35.8) 
37 (33.8) 
33 (43.2) 


418 (400.3) 
83 (80.2) 
60 (63.1) 
91 (93.4) 


256 (237.8) 
86 (74.8) 
39 (56.4) 
85 (87.2) 


475 (515.0) 
365 (389.8) 
462 (442.6) 
1656 (1694.7) 



TABLE XI: Minimum and maximum fractional errors 
among all the (p M ,cos# M ) bins, including the largest 
error sources. The last column shows the fractional 
error on the total number of events, taking into account 
the correlations between the (p^jCOS^) bins. 



Systematic error 



Error Size (%) 





Minimum and 
maximum fractional 
error 


Total fractional 
error 


B-Field Distortions 


0.3 - 6.9 


0.3 


Momentum Scale 


0.1 - 2.1 


0.1 


Out of FV 


- 8.9 


1.6 


Pion Interactions 


0.5 - 4.7 


0.5 


All Others 


1.2 - 3.4 


0.4 


Total 


2.1 - 9.7 


2.5 



We select CC v e interactions by applying the same cri- 
teria as described in Section [VI B[ except that the energy 
loss for the highest momentum negatively charged parti- 
cle is required to be consistent with an electron instead of 
a muon, and interactions in FGD2 are used to increase 
the sample size. For electrons of momenta relevant to 
T2K, the energy loss is 30-40% larger than for muons 
at the same momenta, and so electrons and muons are 
well separated since the TPC energy loss resolution is 
less than 8% [3D]. In addition, for tracks which reach 
the downstream ECAL, we use the information from the 
ECAL to remove events in which the lepton candidate is 
consistent with a muon. 

The total number of selected events in the electron 
candidate sample is 927. The signal efficiency for select- 
ing CC v e interactions in the FGD1 and FGD2 FV is 
31.9% with an overall 23.7% purity. For higher momenta 
the relative purity of the selection increases (42.1% for 
Pe > 300 MeV/c). 

The majority of selected v e are from kaon decay (80%). 
The dominant background events (78% of the total back- 
ground) are low energy electrons produced by photon 
conversion in the FGDs, called the 7 background. The 
photons come from 7r° decays, where the 7r°s are gener- 



ated in interactions either in the FGD or in the ma- 
terial which surrounds the FGD. A total of 7% of the re- 
maining background events are misidentificd muons com- 
ing from interactions. The probability for a muon to 
be misidentified as an electron is estimated to be less than 
1% across most of the relevant momentum range. This 
probability is determined using a highly pure (>99%) 
sample of muons from neutrino-sand interactions. Fi- 
nally, background not belonging to the two previous cat- 
egories is mainly due to protons and pions produced in 
NC and CC interactions in the FGD. Fig. 17 (a) shows 



the momentum distribution of the highest momentum 
track with negative charge for each event in the selected 
electron candidate sample. 

We estimate the uncertainties on the detector response 
modeling for the electron candidate sample in the same 
manner as described in Section [VI C[ with additional un- 
certainties considered for the FGD2 interactions in the 
selection, and for electron-PID selection. The total de- 
tector response systematic uncertainty on the electron 
candidate sample is 5.7%, with the TPC PID (3.8%) un- 
certainty as the largest. 

The rate of intrinsic v e interactions is determined with 
a likelihood fit to reconstructed momenta of electron can- 
didate events. To constrain the large background from 
photons, a control sample of positron (positive charge, 
electron PID tracks) candidates is used. Fig.[l7](b) shows 
the momentum distribution of candidate positrons. The 
sample is composed of positrons at lower energies and 
protons at higher energies. We simultaneously fit the 
electron and positron candidate samples to determine 
the photon background and v e signal rate normalizations. 
The misidentified muon background component is fixed 
according to the estimate from the pure muon control 
sample, and other smaller background sources are fixed 
according to the nominal predictions. Neutrino flux, neu- 
trino cross section, and detector response uncertainties 
are included in the likelihood fit. 

The inferred rate of CC v e events in data from the 
likelihood fit normalized by the prediction is 0.88 ± 
0.10(stat.) ± Q.15(syst.). The measured v e rate at the 
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near detector is consistent with the prediction within 
systematic uncertainties. The neutrino flux and cross 
section systematic uncertainties are the dominant con- 
tributions to the total systematic error on the v e rate. 
In Section I VIII we show the v P rate after the flux and 



cross section parameters are tuned by the fit to the CC 
v„ data. 
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FIG. 17: (a) Momentum distribution of the highest 
momentum track with negative charge for each event in 
the electron candidate sample at ND280. The inset 
shows the region with momentum > 300 MeV/c. (b) 
Momentum distribution of the highest momentum track 
with positive charge for each event of the positron 
candidate control sample. The "Other Backgrounds" 
component is mainly due to protons and pions from NC 
and CC interactions in the FGD. The energy loss of 

positrons and protons (pions) is similar at 
p wlOOO MeV/c (200 MeV/c), resulting in the presence 
of these particles in the positron candidate sample. 



VII. ND280 CONSTRAINT ON THE 
NEUTRINO FLUX AND CROSS SECTION 
MODELS 

The rate of neutrino interactions measured at the 
ND280 detector has power to constrain the neutrino flux 
and interaction models used to predict the v e candidate 
event rate at the SK detector. The predicted SK i/ e sig- 
nal and neutral current background both depend directly 
on the unoscillated flux, while the intrinsic v e back- 
ground depends on the v e flux. As shown in Fig. |9j both 
the SK v ^ and v e flux predictions are correlated to the 
ND280 Un flux prediction through the underlying data 
and assumptions applied in the flux calculation. Both 
the SK v e signal and intrinsic v e background also depend 
on the charged current interaction model. Hence, a fit to 
the CC-inclusive events from ND280 can constrain flux 
and cross section nuisance parameters relevant to the SK 
prediction. 

We fit the near detector CCQE-like and CCnonQE- 
like i/fj, data to determine tuned values of the and v e 
flux parameters and cross section model parameters, de- 
scribed in Sections |IV| and [V] respectively. The fit in- 
cludes the marginalization of nuisance parameters de- 
scribing uncertainties in the simulation of the detector 
response and parameters describing parts of the neutrino 
interaction model that are not correlated for ND280 and 
SK selections. The tuned parameters are then applied 
to predict the v & signal and background interactions at 
SK. The fit also incorporates constraints on the flux and 
cross section models determined independently from the 
ND280 data constraint to properly propagate all con- 
straints to the SK event rate predictions. 



A. ND280 likelihood 

The fit maximizes a likelihood that includes the binned 
likelihood of the ND280 data and the prior constraints on 
the flux model, the interaction model, and the detector 
response model: 



C ND (b,x,d\Nf) = Tr flux (b)ir xsec (x)ir det (d)x 



n 



[N?(b,x,db] N ?e- N W> 3 & 



(14) 



^ flux 

(b), TT xsec (x), iTdet(d) are multivariate normal distri- 
butions that are functions of the flux (b), neutrino cross 
section (x) and detector response (d) nuisance param- 
eters. These functions encode the prior constraints on 
the nuisance parameters and depend on the nominal pa- 
rameter values and the parameter errors or covariancc 
matrices described in previous sections. The likelihood 
includes the product of the Poisson probabilities for the 
Nbms = 40 bins of the CCQE-like and CCnonQE-like se- 
lections. For each bin the predicted number of events, 
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Nf(b,x,d), is evaluated based on the values of the nui- 
sance parameters, and compared to the measurement, 
Nf. To obtain fit results that more closely follow a % 2 
distribution |104j . we define the likelihood ratio: 



£ ND (b,x,d\N?) 



C ND (b°,x°,dP,N? = Nf\N?) 



(15) 



Here the denominator is the likelihood evaluated with Nf 
set equal to Nf and the nuisance parameters set to their 
nominal values: b° = 1, x°, dr = 1; both b and dhave 
nominal values of 1. The quantity that is minimized is 

— 21n(£ rat .j ): 

— 2\n(£. rat io) — 

N bins 

2 N?(b,x,d)-Nf + Nfln[Nf/N?(b,x,d)} 

i=l 
N b N b 

+EE( 1 *-w 1 )i J -(i-y 
i=i j=i 

N x 

+EE(^-^(^ 1 m^ -^) 

i=l j=l 

N binB N bins 

+ E E (i-^WMT'kifi-rfi) 



(16) 



In 



\V d (b°,xP)\ 



The predicted number of events in each observable 
bin, N p (b,x,d) depends on the value of the b, x = 



0? 



zfnorm ztresp' 



, and d nuisance parameters: 



Nf = d, 



E u bins Int. modes 

^ ^ b 



3 



k 



(17) 



are the nominal Monte Carlo templates that 
predict the event rate for bins in the observables, i, true 
neutrino energy, j, and neutrino interaction modes, k. 
The b parameters multiply the flux prediction in bins of 
true neutrino energy. The detector response parameters, 
d, multiply the expected number of events in each observ- 



able (p^ 



,cos( 



bin. The x are included in the prediction 



in one of two ways. The x% orm are cross section param- 
eters that multiply the neutrino cross section normaliza- 
tion for a given true neutrino energy bin and one of the k 
interaction modes. We model the effect of the remaining 
cross section parameters, x resp , with pre-calculated re- 
sponse functions, Wi,j : k{x resp ), that have a value of 1 for 
the nominal parameter settings and can have a non-linear 
dependence on the cross section parameters. 



The remaining terms in Eq. 16 correspond to the prior 
constraints on the flux, cross section and detector re- 
sponse models discussed in earlier sections. V& is the prior 
fractional covariance matrix, corresponding to Figures [8] 



and[9| The covariances of flux predictions at ND280 and 
SK are included so that the fit to ND280 data can con- 
strain the SK flux parameters. The prior covariance ma- 
trix for the neutrino interaction parameters, V Xl is diag- 
onal for most parameters with entries corresponding to 
the errors listed in Table IVll Correlations are included for 
the parameters constrained by the fit to MiniBooNE sin- 
gle pion data. The Vd fractional covariance matrix, with 



correlations shown in Fig. 16 incorporates the simulated 
detector efficiency and reconstruction uncertainties, final 
state interaction errors and Monte Carlo statistical er- 
rors. The final term in the likelihood is present since 
the Monte Carlo statistical errors included in Vd depend 
on the b and x parameters through the weights applied 
to the simulated events. Since Vd is not constant, the 
determinant from the multivariate normal distribution, 
Kdet(d), cannot be dropped from the — 21n(£ rat j ). 



B. Parameter propagation and marginalization 

This fitting method extrapolates the ND280 constraint 
on the neutrino flux and interaction model to the far de- 
tector prediction through the simultaneous variation of 
ND280 and SK flux parameters, and the constraint on 
the common interaction model parameters. After the 
— 21n(£ rat i ) is minimized, we apply a subset of the fit- 
ted parameter values to the calculation of the expected 
v e candidate rate at SK. The subset of parameters which 
are substantially constrained by the ND280 data sets and 
are also relevant to the event rate prediction at SK are 
listed in Table |XII| Since they are not used to calcu- 
late the predicted event rates at SK, the flux parameters 
for ND280, nuclear model-dependent cross section pa- 
rameters, and detector response systematic parameters 
are marginalized by integrating out their dependence in 
— 2hi(C ra tio) under the assumption of a quadratic depen- 
dence near the minimum. The remaining cross section 
parameters do not affect the SK event prediction sub- 
stantially and these are also marginalized. 



C. ND280 fit results 

The resulting (p M ,cos# M ) distributions from the fit to 
the ND280 samples are shown in Fig. 18 We evaluate the 
post-fit agreement between model and data by generat- 
ing 2000 pseudo-experiments with statistical and system- 
atic variations, and fitting them to obtain the minimum 
— 21n(£ rat i ) value for each pseudo-experiment. The dis- 
tribution of these values resembles a \ 2 distribution of 41 
degrees of freedom. Thus the value [— 2\n(C ra tio)]min — 
29.7 from the fit to data indicates that the data are con- 
sistent with the prediction within the prior uncertainties 
assigned for the neutrino flux model, neutrino interaction 
model, and detector response model. 

The propagated neutrino flux and cross section param- 
eter values prior to and after the fit are listed in Ta- 
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FIG. 18: The fitted p M , cos 6> M bins from the ND280 CCQE-like (left) and CCnonQE-like (right) samples. All values 
in the plot are divided by the shown bin width. The > 900 MeV/c bin additionally contains the overflow bin, and 
is normalized by a bin width of 1100 MeV/c. The prediction prior to the fit uses the modifications to the NEUT 
model parameters derived from fits of the MiniBooNE single pion data. 



ble |XII| The fit decreases the flux prediction near the 
spectrum peak to improve agreement with the data. In 
addition to modifying the parameter central values and 
uncertainties, the fit also sets the correlations between 
parameters. Prior to the fit, the flux and cross section 
model parameters have no correlation, but the fit intro- 
duces anti-correlations, as shown in Fig. 19 The anti- 



correlations arise because the event rate depends on the 
product of the neutrino flux and the neutrino interaction 
cross section. 



D. Consistency checks with ND280 data 

We perform a consistency check of the fit results by 
applying the fitted parameters to the ND280 MC simu- 
lation and investigating the data and predicted rates in 
more finely binned kinematic distributions. Fig. |20| shows 
the level of agreement in the muon momentum and angle 
distributions of the CCQE and CCnonQE-like samples 
before and after the fit constraint to the flux and cross 
section models are applied. The fitted flux and cross sec- 
tion models show improved agreement with the data. 

We also apply the fitted flux and cross section pa- 
rameters to the ND280 CC v e simulation. Adopting 



the same analysis as in Section VI D while using the 



fitted cross section and flux parameters, we measure 
the ratio of inferred to predicted CC u e rate to be 
0.91 ± 0.10(stat.) ± 0.1Q(syst.). The CC v e rate remains 
consistent within the reduced systematic uncertainties 
after tuning. 

To check the modeling of NC7r° production, we mea- 
sure the rate of single ir° with the POD detector using 
a data set corresponding to 8.55 x 10 19 POT. The ra- 
tio of the measured to the predicted rate is found to be 
0.84 ± 0.16(stat.) ± 0.18(syst.). When normalized to the 
corresponding ratio from the ND280 CC selection, we 
measure a ratio of 0.81 ± 0.15(stat.) ± 0.14(syst.), in- 
dicating that the predicted rate is consistent with the 
measured rate within errors. 



VIII. SK ELECTRON NEUTRINO SELECTION 

For a non-zero value of #13, we expect an oscillated 
Vn — > v e flux with a peak oscillation probability near 
600 MeV at the SK detector. To detect the oscillated v e , 
we select SK events with a single electron-like Chercnkov 
light ring, providing a sample that is enhanced in CCQE 
v e interactions. Additional cuts are applied to reduce the 
backgrounds from intrinsic v e contamination of the beam 
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TABLE XII: Prior and fitted values and uncertainties of 
the propagated neutrino flux and cross section model 
parameters. 



Parameter 


Prior Value Fitted Value 




0.0-0.4 


GeV 


1 


00 ±0 


12 





98 ± 0.09 




0.4-0.5 


GeV 


1 


00 ±0 


13 





99 ±0.10 




0.5-0.6 


GeV 


1 


00 ±0 


12 





98 ± 0.09 


v n 


0.6-0.7 


GeV 


1 


00 ±0 


13 





93 ± 0.08 


v v 


0.7-1.0 


GeV 


1 


00 ±0 


14 





84 ± 0.08 


v„ 


1.0-1.5 


GeV 


1 


00 ±0 


12 





86 ± 0.08 


v v 


1.5-2.5 


GeV 


1 


00 ±0 


10 





91 ± 0.08 


v» 


2.5-3.5 


GeV 


1 


00 ±0 


09 





95 ±0.07 




3.5-5.0 


GeV 


1 


00 ±0 


11 





98 ± 0.08 


v» 


5.0-7.0 


GeV 


1 


00 ±0 


15 





99 ±0.11 




> 7.0 GeV 


1 


00 ±0 


19 


1 


01 ±0.15 



0.0-1.5 GeV 

> 1.5 GeV 
0.0-0.5 GeV 
0.5-0.7 GeV 
0.7-0.8 GeV 
0.8-1.5 GeV 
1.5-2.5 GeV 
1.5-4.0 GeV 

> 4.0 GeV 



1.00 ±0.12 
1.00 ±0.11 
1.00 ±0.13 
1.00 ±0.12 
1.00 ±0.14 
1.00 ±0.10 
1.00 ±0.10 
1.00 ±0.12 
1.00 ±0.17 



0.95 ±0.10 
0.95 ±0.10 
0.96 ±0.10 
0.96 ±0.10 
0.96 ±0.11 
0.94 ±0.08 
0.97 ±0.08 
0.99 ±0.09 
1.01 ±0.13 



i> e 0.0-2.5 GeV 
u e > 2.5 GeV 



1.00 ±0.19 
1.00 ±0.14 



0.97 ±0.18 
1.02 ±0.11 



(GeV) 
M% ES (GeV) 
xf E 
x? ci * 

N Cln® 



1.21 ±0.45 
1.16 ±0.11 
1.00 ±0.11 
1.63 ±0.43 
1.19 ±0.43 



1.33 ±0.20 
1.15 ±0.10 
0.96 ±0.09 
1.61 ±0.29 
1.19 ±0.40 



and 7T° background. The selection is described here. 



A. The SK detector simulation 

We simulate the predicted event distributions at the far 
detector with the neutrino flux prediction up to 30 GeV, 
the NEUT cross section model, and a GEANT3-based 
detector simulation. The v e signal events from — > v e 
oscillation are produced using the predicted spectrum 
without oscillations, and the v e cross section; oscillations 
probabilities are applied after the simulation. Addition- 
ally, the intrinsic v^, V^, v e and V e components of the 
beam are generated from the intrinsic flux predictions 
without oscillations. 

SKDETSIM, a GEANT3-based simulation of the SK 
detector, simulates the propagation of particles produced 
in the neutrino interactions in the SK detector. We use 
the GCALOR physics package to simulate hadronic in- 
teractions in water since it successfully reproduces pion 
interaction data around 1 GeV. For pions with momen- 
tum below 500 MeV, however, we use custom routines 
based on the cascade model used by NEUT to simulate 
interactions of final state hadrons. SKDETSIM models 
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FIG. 19: The neutrino flux and cross section parameter 
correlations before (a) and after (b) the fit to the ND280 
data. The flux parameters are ordered by increasing 
energy, and the cross section parameter ordering is: 



71/ 



QE 



CCQE low energy normalization, CC17T 



low energy normalization and NC17T normalization. 
The antineutrino flux parameters are included in the 
analysis but not in the plot for clarity; the effect of 
those parameters on the analysis is small. 



the propagation of light in water, considering absorption, 
Rayleigh scattering, and Mie scattering as possible in- 
teractions. The parameters employed in the models of 
these processes have been tuned using a number of laser 
calibration sources [41] . Example event displays for sim- 
ulated SK events are shown in Fig. [21] 

As a final step, we scale the predicted events accord- 
ing to the constrained flux and cross section models from 
the fit to the ND280 CC-inclusive data, and according 
to the oscillation probability. The three-neutrino oscil- 
lation probability, including matter effects, is calculated 
for each event with the parameter values shown in Ta- 
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FIG. 20: Comparisons of the p M (left column) and cos# M (right column) distributions for CCQE-like selected 
events in (a) and (b) and CCnonQE-like selected events in (c) and (d). The solid line represents the NEUT 
nominal prediction and the hatched region represents the post-fit MC prediction. The dots are the data events. 
Below each graph, the data/MC ratio is shown for both the NEUT nominal prediction (empty triangle) and post-fit 
MC prediction (full triangle). The error on the points is the statistical error on the data. 



TABLE XIII: Default neutrino oscillation parameters 
and earth matter density used for the MC prediction. 



Parameter 



Value 



|Am| 2 | 
sin 2 0i2 
sin 2 2023 
Scp 

Mass hierarchy 
v travel length 
Earth matter density 



eV 2 



eV 2 



7.6x10 
2.4x10 
0.32 
1.0 


Normal 
295 km 
2.6 g/cm 3 



ble XIII| unless otherwise noted. 



B. Neutrino event selection 

We select fully contained (FC) events, which deposit 
all of their Cherenkov light inside the SK inner detector 
(ID), by applying the following selection criteria. First, 
any photomultiplicr tubes (PMTs) which register suffi- 
cient charge, a "PMT hit", in the outer detector (OD) 
are associated with other nearby PMT hits to form clus- 
ters. Events with greater than 15 hits in the highest 
charge OD cluster are rejected. Second, most of the low 
energy (LE) events are removed by requiring that the to- 
tal charge from ID PMT hits in a 300 ns time window 
must be above 200 photoelectrons (p.e.), corresponding 
to visible energy, E vis , above 20 MeV. Visible energy is 
defined as the energy of an electromagnetic shower that 
produces the observed amount of Cherenkov light. In or- 
der to remove events caused by radioactivity very close 
to the PMT, a third cut removes events in which a single 
ID PMT hit has more than half of the total charge in a 
300 ns time window. 

The final FC selection cut rejects events with ID pho- 
tomultipliers which produced light because of a discharge 



27 




FIG. 21: Example event displays for the SK simulation 
of a) Up CCQE (single well-defined ring from the 
muon), b) v e CCQE (single diffuse ring from the 
electron) and c) za, NCl7r° interactions (two diffuse 
rings from the tt° — > 77 decay). The images show the 

detected light pattern at the ID wall, with the 
cylindrical SK detector shown as a flat projection. The 
color indicates the amount of charge detected by the 
PMT, with purple dots corresponding to the least 
amount of charge, and red the most. 



around the dynode, called "flasher" events. The cut iden- 
tifies flasher events from their timing distribution, which 
is much broader than neutrino events, and from a re- 
peating pattern of light in the detector. However, neu- 
trino events arc sometimes misidentified as flasher events 
when the neutrino interaction vertex is close to the ID 
wall. There have been a total of 8 events that have been 
rejected by the flasher cut during all run periods. From 
event time information and visual inspections, it is clear 
that all eight events are induced by beam neutrino inter- 
actions. The predicted number of rejected beam events 
from this cut is 3.71 events; the probability to observe 8 
or more events when 3.71 are expected is 3.6%. All eight 
events have vertices close to the ID wall, and would be 
rejected by the fiducial cut. 

We define the quantity ATo, which is the timing of the 
event relative to the leading edge of the spill, accounting 
for the travel time of the neutrino from production to de- 
tection. Fig. [22] shows the AT distribution of all FC, OD 
and LE events within ±500 fis of the beam arrival time; 
the spill duration is about 5 fis. A clear peak at ATo = 
is seen for the FC sample. We observe five FC events 
outside of the 5 /Lis spill window. The expected number 
of such out-of-time FC events, mainly low energy events 
and atmospheric neutrino events, is estimated to be 3.3 
from data collected when the beam is not present. Fig. [23] 
shows the ATo distribution of FC events within the spill 
window. We correct the ATo of each event to account 
for the position of the neutrino interaction vertex and 
the photon propagation time from the interaction vertex 
to the PMTs. The far detector event timing clearly ex- 
hibits the eight bunch beam timing structure. The eight 
dotted vertical lines in the figure represent the 8 bunch 
centers at intervals of 581ns from a fit to the observed 
FC event timing. The RMS value of the residual time 
distribution between each FC event and the closest of 
the fitted bunch center times is about 25 ns. 

We require the AT for selected FC events to be be- 
tween —0.2 /lis to 10 fis. We observe 240 such in-time fully 
contained events. We extract a fully contained sample 
within the fiducial volume (FCFV) by further requiring 
i?vis to be above 30 MeV and the reconstructed vertex be 
2 m away from the ID wall. We observe 174 such FCFV 
events, while the expected accidental contamination from 
events unrelated to the beam, mostly atmospheric neu- 
trino interactions, is calculated to be 0.005 events. 

CC v e interactions (v e N — > e~X) are identified in SK 
by detecting a single, electron-like ring; at the energy of 
the T2K neutrino beam, most of the produced particles 
other than the electron are below Cherenkov threshold 
or do not exit the nucleus. The main backgrounds are 
intrinsic v e contamination in the beam and NC interac- 
tions with a misidentified ir°. The analysis relies on the 
well-established reconstruction techniques developed for 
other data samples in SK [105J. The single, electron- 
like ring selection criteria are unchanged from our pre- 
vious measurement of electron neutrino appearance |21) . 
and were determined from MC studies before data-taking 
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FIG. 22: AT distribution of all FC, OD and LE events 

observed in the ±500 fjs T2K windows. The OD 
histogram is stacked on the FC histogram, and the LE 
histogram is stacked on the OD and FC histograms. 
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FIG. 23: ATo distribution of FC events zoomed in on 
the spill time observed during T2K Run 1+2 and Run 
3. The eight dotted vertical lines represent the 581ns 
interval bunch center position fitted to the observed FC 
event times. 



commenced. We select CC v e candidate events which 
satisfy the following criteria: 

(1) The event is fully contained in the ID and the re- 
constructed vertex is within the fiducial volume 
(FCFV) 

(2) There is only one reconstructed ring 

(3) The ring is electron-like 

(4) The visible energy, E v i s , is greater than 100 MeV 



(5) There is no Michel electron 

(6) The event's invariant mass is not consistent with a 
7T° mass 

(7) The reconstructed neutrino energy, £^ cc , is less 
than 1250 MeV 

The -Evis cut removes low energy NC interactions and 
electrons from the decay of unseen muons and pions, 
such as cosmic muons outside the beam time window 
or muons below Cherenkov threshold. A Michel electron 
is an electron from muon decay which is identified by 
looking for a time-delayed ID-PMT hit peak after the 
primary neutrino interaction. In order to reduce NC w° 
events, we utilize a special fitter which reconstructs each 
event with a two photon ring hypothesis. It searches 
for the direction and energy of the second ring which 
maximizes the likelihood based on the light pattern of 
the event [106 . Fig. 24 shows the invariant mass Afi nv 



distribution of the two photon rings for the data and 
simulation. As shown in the figure, the NC background 
component peaks around the ir° invariant mass, hence 
events with M- mv > 105 MeV/c 2 are cut. Finally, the en- 
ergy of the parent neutrino is computed assuming CCQE 
kinematics and neglecting Fermi motion as follows: 



El 



ml - (m n - E b f 



+ 2(m„ - E b )E e 



2(m„ - E b - E e + p e cos 6 e ) 



(18) 



where m p is the proton mass, m n the neutron mass, and 
E b = 27 MeV is the binding energy of a nucleon inside a 
16 nucleus. E e , p e , and 9 e are the reconstructed elec- 
tron energy, momentum, and angle with respect to the 
beam direction, respectively. We select E™ c < 1250 MeV 
since the signal at high energy is expected to be small for 
the atmospheric mass splitting, and the intrinsic v e back- 



ground is dominant in this region, as shown in Fig. 25 



The numbers of observed events after each selection 
criterion, and the MC predictions for sin 2 2fl i3 — 0.1 and 



sin 26>i3 = 0, are shown in Tables 



XIV 



and 



XV 



respec- 

Eleven events remain in the data after all v e ap- 

We esti- 



tively. 

pearance signal selection criteria are applied, 
mate the v e appearance signal efficiency in the SK FV to 
be 62%, while the rejection rates for CC +V^ : intrinsic 
CC v e +V e , and NC are > 99.9%, 80%, and 99%, respec- 
tively. More than half of the remaining background is due 
to intrinsic CC v e interactions (57% for sin 2 29^ = 0.1). 
The fraction of CCQE events in the CC v e signal and 
background are 80% and 65%, respectively. NC interac- 
tions constitute 41% of the total surviving background, 
80% of which are due to ir° mesons and 6% of which 
originate from NC single photon (A — > Nj) production. 

Additional checks of the eleven data events are per- 
formed. From visual inspection, it appears that all events 
have only a single, electron-like Cherenkov ring. A KS 
test of the observed number of v e candidate events as 
a function of accumulated POT is compatible with the 
normalized eve nt r ate b eing constant (p-value = 0.48) as 
shown in Fig. 26 Fig. 27 shows the (x,y) and (r 2 ,z) 
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TABLE XIV: Event reduction for the v e appearance search at the far detector. After each selection criterion is 
applied, the numbers of observed and MC expected events of CC v^, intrinsic CC v e , NC, and the CC v e signal, are 
given. All MC samples include three- neutrino oscillations for sin 2 2#i3 = 0.1, 5c p = 0, and normal mass hierarchy. 





Data MC total CC 


CC v e 


NC 


CC f M Ve 


(0) interaction in FV 


n/a, 


311.4 


158.3 


8.3 


131.6 


13.2 


(1) fully contained in FV 


174 


180.5 


119.6 


8.0 


40.2 


12.7 


(2) single ring 


88 


95.7 


68.4 


5.1 


11.4 


10.8 


(3) e-like 


22 


26.4 


2.7 


5.0 


8.0 


10.7 


(4) E vis > 100 MeV 


21 


24.1 


1.8 


5.0 


6.9 


10.4 


(5) no delayed electron 


16 


19.3 


0.3 


4.0 


5.9 


9.1 


(6) not 7r°-like 


11 


13.0 


0.09 


2.8 


1.6 


8.:> 


(7) ET < 1250 MeV 


11 


11.2 


0.06 


1.7 


1.2 


8.2 



TABLE XV: Same as Table XIV but with MC prediction for sin 2 29 13 = 0. 





Data MC total CC 


CC v e 


NC 


CC f M -> v e 


(0) interaction in FV 


n/a 


299.0 


158.5 


8.6 


131.6 


0.3 


(1) fully contained in FV 


174 


168.5 


119.8 


8.2 


40.2 


0.3 


(2) single ring 


88 


85.4 


68.5 


5.3 


11.4 


0.2 


(3) e-like 


22 


16.1 


2.7 


5.2 


8.0 


0.2 


(4) E vis > 100 MeV 


21 


14.1 


1.8 


5.2 


6.9 


0.2 


(5) no delayed electron 


16 


10.6 


0.3 


4.2 


5.9 


0.2 


(6) not 7r°-like 


11 


4.8 


0.09 


2.9 


1.6 


0.2 


(7) El ec < 1250 MeV 


11 


3.3 


0.06 


1.8 


1.2 


0.2 
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FIG. 24: Distribution of invariant mass M; nv when each 
event is forced to be reconstructed as two photon rings. 

The data are shown as points with error bars 
(statistical only) and the MC predictions are in shaded 
histograms. The last bin shows overflow entries. The 
arrow shows the selection criterion M inv < 105 MeV/c 2 . 



FIG. 25: Distribution of the reconstructed neutrino 
energy spectrum of the events which pass all v e 
appearance signal selection criteria with the exception 
of the energy cut. The data are shown as points with 
error bars (statistical only) and the MC predictions are 
in shaded histograms. The arrow shows the selection 
criterion E™ c < 1250 MeV. 



distributions of the reconstructed vertices of observed v e 
candidate events. As we previously reported, the first 
6 candidate events were clustered near the edge of the 
FV in the upstream beam direction. We observe no such 
clustering in the newly observed 5 events (pink points in 
the figure). All event vertices are x < in the SK coor- 



dinate system which is not related to the beam direction. 
Other T2K neutrino selections with larger event samples, 
such as the CC Vn selection, populate the entire x and y 
region. Figure [28] shows the distribution of distance from 
the ID wall to the vertex along the beam direction for 
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FIG. 26: The cumulative number of observed v e 
candidate events as a function of accumulated POT. 
The vertical dashed lines separate the three running 
periods, and the dotted line indicates the horn current 
change during Run 3. The solid line indicates a 
hypothesis of constant event rate. 



events passing all v e selection cuts except the FV cut. A 
KS test to this distribution yields a p-value of 0.06. In 
addition, a dedicated selection of penetrating particles 
produced in upstream, out-of-FV neutrino interactions 
shows no indication of an excess. 



C. SK efficiency and reconstruction uncertainties 

We have studied the systematic uncertainties on the 
simulation of the SK event selection efficiency and re- 
construction using comparisons of data and MC control 
samples. The error on the FC event selection is estimated 
to be 1%, with a dominant contribution from the flasher 
event rejection. We evaluate the flasher rejection uncer- 
tainty from the difference in the cut efficiency between 
the atmospheric neutrino data and MC simulation. We 
estimate the uncertainty on the fiducial volume definition 
to be 1% by comparing the reconstructed vertex distribu- 
tions of observed and simulated cosmic-ray muons which 
have been independently determined to have stopped in- 
side the ID. We estimate an energy scale uncertainty of 
2.3% from comparisons of distributions between cosmic- 
ray data and simulated samples. These samples include 
the reconstructed momentum spectrum of electrons from 
the decay of cosmic ray muons, cosmic-ray muons which 
stop in SK and have similar energies to the T2K neu- 
trino events, and the reconstructed mass of neutral pi- 
ons from atmospheric neutrino interactions. The error 
on the number of v e candidate events due to the uncer- 
tainty on the delayed, decay-electron tagging efficiency is 
0.2%. We evaluate this uncertainty from a comparison of 
the tagging efficiency between cosmic-ray stopped muon 
data and MC samples. 
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FIG. 27: a) Two-dimensional (x,y) distribution of the 
reconstructed vertex positions of observed v e candidate 

events, b) Two-dimensional (r 2 = x 2 + y 2 , z) 
distribution of the reconstructed vertex positions of the 
observed v e candidate events. The arrow indicates the 
neutrino beam direction and the dashed line indicates 
the fiducial volume boundary. Black markers are events 
observed during Run 1+2, and pink markers are events 
from Run 3. Open squares represent events which 
passed all the v e selection cuts except for the fiducial 
volume cut. 



The remaining uncertainties on the detection efficiency 
are evaluated in categories corresponding to the particles 
exiting the target nucleus. The "CC v e single electron" 
category is comprised of interactions where a single elec- 
tron is emitted and is the only detectable particle in the 
final state. The "CC v e other" category includes all other 
CC v e interactions not in the CC v e single electron cate- 
gory. NC events are also classified based on the particle 
type which exits the nucleus. The "NC single 7r°" cate- 
gory includes events with only one ir° in the detector. 



31 



> 



u 

s 

Z 



Q 



Run 1-3 data 
Signal Pred. 
Background Pred. 
Background: True 
vertex outside ID 




1000 2000 3000 

Distance from ID wall to vertex along beam direction (cm) 



FIG. 28: Distribution of events which pass v e selection 
cuts except for the FV cut as a function of the distance 
from the ID wall to the vertex, calculated along the 
beam direction. The solid line indicates the expected 
distribution for signal (sin 2 20i3=O.l) and background, 
and the background prediction is shown with the dashed 
line. The hatch-filled histogram shows the subset of 
background whose true vertex is outside the ID. 



The topological light pattern of the rings provides the 
information needed to construct quantities used in the 
selection: the number of rings (cut 2), particle identifica- 
tion (cut 3) and the invariant mass (cut 6). We evaluate 
the systematic error on the efficiency of each of the three 
topological cuts on the selection with a fit to SK atmo- 
spheric neutrino data using MC simulation-based tem- 
plates. We create two control samples in the SK atmo- 
spheric neutrino data set which are sensitive to CC v e sin- 
gle electron and CC v e other event types. The v e enriched 
control samples pass the FCFV, E v - ls > 100 MeV crite- 
ria; however the number of decay electrons in the event 
is used to separate QE-like (single ring) from nonQE- 
like (multiple rings) instead of the ring-counting algo- 
rithm. Each control sample is divided into one "core" 
sub-sample, which passes the three topological cuts, and 
three "tail" sub-samples, where events have failed one of 
the three topological cuts. The sub-samples are further 
divided into 17 bins (labeled with index i) in p e and 8 e , 
the reconstructed electron momentum and angle with re- 
spect to the beam direction, so that we can evaluate the 
dependence of the systematic errors on these kinematic 
variables. The expected number of events in all sub- 
samples depends on the efficiency of each topological cut, 
£*= {tiring, £p i D,£inv. mass}, and parameters which rep- 
resent systematic uncertainties on the event rate, a. The 
a parameters include uncertainties on the atmospheric 



neutrino flux normalization, the absolute cross section of 
CC non-QE and NC interactions, the v^jv^ relative cross 
section, and the energy dependence of the CCQE cross 
section. We perform a \ 2 fit to the atmospheric control 
samples, allowing the e and a parameters to vary. 

We extract the uncertainties on the CC v e single elec- 
tron and CC v e other event categories based on the effect 
of the selection cuts on the efficiency e within the fit to 
the control samples. We estimate the bias as the differ- 
ence between the fitted value and the nominal value of 
the event rate for two categories (CC v e single electron 
and CC v e other) over 17 reconstructed (p e , 9 e ) bins. The 
correlations between bins are considered. We also include 
uncertainties on the event categories determined from the 
fit; the fit uncertainties are treated as uncorrelated be- 
tween bins. For the CC v e single electron category, the 
bias is estimated to be 1-9% across all bins, while the 
fit uncertainty is 4-8% across all bins. The bias and fit 
uncertainty for the CC v e other category are 27% and 
14%, respectively; this component is a small contribu- 
tion to the signal and background prediction, and so the 
momentum and angular dependence of the uncertainty is 
ignored. As described later, we use these errors and their 
correlations as inputs for deriving the total SK system- 
atic error on the T2K u e appearance candidate events. 

NC interactions producing a single exclusive photon 
via radiative decays of A resonances (NCI7) are a back- 
ground to the v e appearance signal, as the photon ring 
is very similar to an electron ring. We evaluated the 
difference in the selection efficiency between the single 
photon MC sample and the single electron MC sample 
to estimate the uncertainty on the selection efficiency of 
NCI7 events. The difference in relative efficiencies is no 
larger than 1%, so we assign an additional 1% uncer- 
tainty, added in quadrature to the uncertainty on single 
electron rings estimated from the CC v e single electron 
sample efficiency, as the uncertainty on the selection ef- 
ficiency for NCI7 background events. 

We evaluate the systematic uncertainty for events 
where the muon decays in flight with a MC study. The 
Cherenkov ring of the electron from a muon which de- 
cays in flight tends to be in the same direction as the 
parent muon, and therefore these events look similar to 
CC v e interactions. We estimate the uncertainty on the 
expected number of of muon-decay-in-flight background 
events to be 16%, with the largest contribution from the 
uncertainty on the muon polarization. The fraction of 
muons which decay in flight in the selected v e candidate 
event sample is estimated to be smaller than 1%, and so 
this uncertainty does not contribute substantially to the 
total uncertainty on the v e candidates. 

To evaluate the systematic uncertainty for events with 
a 7T° in the final state, we construct "hybrid- 7r°" control 
samples. The "hybrid-7r°" samples contain events where 
a 7T° is constructed using one simulated photon ring and 
a second electron-like ring from the SK atmospheric or 
cosmic-ray samples. The simulated photon ring kinemat- 
ics are chosen such that the two rings follow the decay 
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kinematics of a ir°. The hybrid samples are constructed 
with electron rings from data (hybrid- 7r° data) and the 
simulation (hybrid- tt° MC), and the comparison of the 
two is used to evaluate the systematic uncertainties. 

We investigate the systematic error coming from the 
higher-energy ring and the lower-energy ring separately. 
The "primary" sample uses electron rings from the SK at- 
mospheric samples, with the electron ring having higher 
energy than the simulated photon ring. In the "sec- 
ondary" sample the electron ring has a lower energy than 
the photon ring. Below 60 MeV, electrons from cosmic- 
ray muons are used; otherwise the electrons from the SK 
atmospheric samples are used. 

We compare the efficiency of the v e selection criteria 
on 7r° events in the hybrid- ir° data and hybrid-7r° MC 
samples in each of the 17 (p e ,9 e ) bins. We apply the effi- 
ciency differences as correlated systematic errors among 
bins, while the statistical errors on the efficiency differ- 
ences are applied as uncorrelated systematic errors. For 
the NC single ir° component, we estimate correlated er- 
rors in each (p e ,9 e ) bin to be between 2-60%, and uncor- 
related errors are between 15-50%. The assigned errors 
are larger in the lower momentum bins, where the ir° se- 
lection efficiency is lower. We evaluate the systematic un- 
certainties on events with one or more charged particles 
above Cherenkov threshold and a tt° by using hybrid- 7r° 
control samples with additional simulated rings for the 
extra particles. 

Finally, we combine all systematic uncertainties on the 
v e appearance signal selection at SK into a single covari- 
ance matrix. The covariance matrix has bins in the ob- 
servable kinematic variables, (p e ,0 e ) or -E£ ec , for the four 
event categories: signal CC v e , background CC u^, CC 
v e , and NC. We use this covariance matrix to model the 
systematic uncertainties on the simulated detector effi- 
ciency and reconstruction in the oscillation fits described 
in Section ITXl 



IX. 



OSCILLATION FIT METHOD AND 
RESULTS 



The v e appearance oscillation signal is an excess of v e 
candidates over background. Table |XIV| and Table |XV| 
show the predicted number of v e candidate events after 
we apply the tuned neut rino flux and cross section pa- 
rameters discussed in Sec. VII If sin 2 2# 13 =0.1, we expect 
11.2 events, and if sin 2 2#i3=0, we expect 3.3. We eval- 
uate the systematic uncertainties on the expected signal 
and background event rates due to the uncertainties on 
the flux model, neutrino interaction cross section model 
and SK reconstruction efficiencies, as summarized in Sec- 
tion EH 

The probability to observe 11 or more events based on 
the predicted background of 3.3 ± 0.4 (syst.) events is 
9 x 10~ 4 , equivalent to an exclusion significance of 3.1cr. 
This rate-only hypothesis test makes no assumptions 
about the energy spectrum of the candidate events or 



their consistency with the neutrino oscillation hypothe- 
sis; it is a statement that we observe an excess of electron- 
like events over background. The background model in- 
cludes expected — > v e oscillation through the solar 
term shown in Eq. [2j which corresponds to 0.2 events. 
If the solar term events are instead considered as signal, 
then the probability decreases to 6 x 10~ 4 . 

We fit the v e candidate sample in the three-neutrino 
mixing paradigm to estimate sin 2 2$i3. The dominant ef- 
fect of a non-zero sin 2 2# 13 is to increase the overall rate 
of v e events. However, spectral information, e.g. electron 
momentum and angle with respect to the T2K beam di- 
rection, (p e ,9 e ), or reconstructed neutrino energy, El ec , 
can be used to further separate the signal from back- 
ground. Fig. 29 shows the area- normalized (p e ,9 e ) dis- 



tribution for the v e candidate events predicted by the SK 
simulation. The signal CC v e are predominantly CCQE, 
and peaked at E v « 0.6 GeV, near the first oscillation 
maximum and neutrino flux peak. This results in a clear 
kinematic correlation across the (p e ,9 e ) distribution for 
signal events. This peak is also visible in the E r v ec dis- 
tribution for signal events, shown in Fig. |30| Conversely, 
the backgrounds to the v e signal populate a wider range 
of kinematic space. The NC backgrounds are predom- 
inantly photons misidentified as electron neutrino can- 
didates, when one photon from 7r° decay is not recon- 
structed, or when the two photons are co-linear. This 
background predominantly populates the low momentum 
and forward angle region as well as the signal region. The 
intrinsic beam v e (z7 e ) backgrounds have a larger contri- 
bution of events at higher energy than the oscillated v ei 
and so more often produce electrons with high momen- 
tum in the forward direction. 

We find that based on studies of the E r v ec and (p e , 9 e ) 
kinematic distributions that the (p e ,9 e ) distribution has 
the best power to discriminate signal and background 
with the minimal cross section model dependence, hence 
we perform a two-dimensional extended maximum like- 
lihood fit to the (p e ,9 e ) data distribution. Section IX B 



describes the (p e ,9 e ) likelihood fit to estimate sin 29 13, 
and Section [IX D| describes two additional fits using E r v ec 
and rate-only information for comparison to the (p ei 9 e ) 
fit. 



is e predicted event rate and systematic 
uncertainties 



The predicted number of v e candidates and the event 
shape distribution depend upon the flux, cross section 
parameters, oscillation probability, and the efficiency and 
resolution of the SK detector. We calculate the predicted 
number of events in a given momentum and angular bin 
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FIG. 29: (p e ,9 e ) distribution for v e signal (top left), background (top right), v e background (middle left), 
background (middle right) and V e background (bottom left). Each distribution is normalized to unit area. 
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FIG. 30: The MC reconstructed neutrino energy 
distributions of Vn — y v e CC signal, intrinsic v e CC 
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candidate event sample. The histograms are normalized 
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(19) 

Here, Tf ■ k ; are the nominal Monte Carlo templates that 
predict the event rate as a function of: 

• momentum/angular bins (i). The momentum bins 
are 100 MeV/c wide from MeV/c to 1500 MeV/c 
(15 in total), and the angular bins are 10° wide 
from 0° to 140° with one bin for 8 e > 140° (15 
in total). The bins are ordered by increasing 9 e in 
groups of increasing momentum. 

• flux type (j) with categories for v e signal, back- 
ground, v e background, background and V e 
background. 
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• true neutrino energy (k) with 200 bins (50 MeV 
wide) from GeV to 10 GeV and one bin from 
10 GeV to 30 GeV. 

• interaction mode (I) with categories for CCQE, 
CCItt, CC coherent, CC other, NCItt , NC coher- 
ent and NC other. 



The 



(b 



systematic parameters ; 
x, d lJjfe ,/ s ). The bj tk 



~norm 
L k,l 



re f = 
vary the flux 



ization parameters. The x are cross section parameters 
such as M® E and pp where the effect on the prediction 
is modeled with response functions, Wij_k,h evaluated 
for each combination of observable bin, flux type, 
neutrino energy bin and interaction mode. The dij ^ 
are systematic parameters that vary the normalization 
of the prediction for each combination of observable bin, 
flux type and interaction mode. These parameters are 
used to model variations due to FSI and SK efficiency 
uncertainties. The momentum scale variation according 
to the parameter f s is not shown in Eq. 19 The 
parameter f s scales the momentum range of the bins 
and the bin contents are recalculated assuming a flat 
momentum dependence in each bin. 

We compute three-neutrino oscillation probabilities, 
PkTm(°)> which include matter effects, according to the 
numerical technique defined in Ref [107] . for a given set 
of the oscillation parameters, o. The Sep dependence is 
evaluated by scanning the value of Sep and fitting for 
sin 2 2# 13 with Sep fixed at each scan point. The remain- 
ing oscillation parameters are always held fixed to the 
values listed in Table IXlTTl 

Based on Eq. [l9j we predict both the total number 
of events and the normalized (p e ,0 e ) shape distribution 
(probability density function, PDF). The predicted num- 
ber of events and the predicted (p e ,9 e ) distribution are 
used in the likelihood function of the oscillation fit. The 
effect of the systematic uncertainties on the predicted 
number of events and (p e ,9 e ) PDF are studied by re- 
calculating the rate and PDF under variations of the 
systematic parameters according to the prior probability 
distribution of the parameters. Table |XVI| summarizes 
the uncertainty on the predicted number of events for 
each systematic error source assuming sin 2 2#i3=0 and 
sin 2 20i3 =0.1. 

The uncertainty on background only predicted num- 
ber of events (sin 2 20i 3 =O) is larger than that of sig- 
nal+background due to the larger uncertainties on the 
NC backgrounds (32%); the uncertainty on CC back- 
ground events (14%) is comparable to that of the CC 
signal events. The inclusion of the ND280 measure- 
ments reduces the uncertainty on the total predicted 
event rate due to the flux and CCQE, CC17T+ cross sec- 
tion model from 18.3% to 8.5% (22.6% to 5.0%), assum- 
ing sin 2 26»i 3 =0. (sin 2 26>i 3 =0.1). The far detector effi- 
ciency uncertainty has been reduced from 14.7% (9.4%) 
in the previous analysis [H] to 6.8% (3.0%) assuming 
sin 2 26»i 3 =0.0 (sin 2 26»i 3 =0.1) due to new CC v e and n° 



TABLE XVI: Summary of the contributions to the total 
uncertainty on the predicted number of events, 
assuming sin 2 20i 3 =O and sin 2 26*i 3 =0.1, separated by 
sources of systematic uncertainty. Each error is given in 
units of percent. 
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SK atmospheric control samples; the FSI uncertainty has 
also been reduced from 10.1% (5.4%) in the previous 
results to 2.9% (2.3%) in this analysis, as correlations 
between reconstructed bins are now taken into account 



(Sec, [yen . 

We also consider the effect on the (p e ,0 e ) PDF, 
or "shape" of (p e ,9 e ), as the systematic parameters 
are changed. Fig. [M] (Fig [32| shows the varia- 
tion of the one-dimensional angular slices of the to- 
tal signal+background as a function of momentum for 
sin 2013=0.1 (sin 2 20 13 =O). The main contributions to 
the shape systematic uncertainties for sin 2 2#i3=0 are the 
SK detector efficiency and W c s parameters in the neu- 
trino interaction models which introduce uncertainties 
on the (p e ,9 e ) distribution of (NC) background. For 
sin 2 26>i3=0.1, the dominant contributions to the shape 
systematic uncertainties are the flux, CCQE and 
CC17T cross section parameters, xsf, and the SK detec- 
tor uncertainties. 



B. i/ e likelihood 

We define an extended likelihood as the product of 
the likelihoods for the observed number of v e candidate 
events (C norm ), the shape of (p e , 9 e ) distribution of those 
events (C shape) and the constraint term for the nuisance 
parameters (C sys t)- The normalization term, C norm , is 
defined by the Poisson probability to observe the number 
of v e candidate events, N obs , given a predicted number 
of events, n = J2if Nf j(o, f): 
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FIG. 31: The PDF as a function of momentum for 
different angular bins (10 of 15 (p e ,9 e ) bins are shown) 
and sin 2 2#i3=0.1. The shaded areas represent one sigma 

deviations that are evaluated by fluctuating all of the 
systematic parameters according to a multivariate normal 
distribution using their prior values and covariance 
matrix. 



FIG. 32: The PDF as a function of momentum for for the 
same bins as in Fig. 



31 



and sin 2#i3=0. The shaded areas 
represent one sigma deviations that are evaluated by 
fluctuating all of the systematic parameters according to a 
multivariate normal distribution using their prior values 
and covariance matrix. 



The shape term, C s h ape is defined by the product of the 
probabilities that each event has a particular value of 
the momentum and angle (p e ,9 e ). We use a Bayesian 
marginalization technique in order to incorporate the 
systematic uncertainties, by integrating over all system- 
atic parameters. Then, the only free parameter in the 
marginalized likelihood is sin 2 2^i 



'13: 



C\o) 



r 



1 

(o, f)xC s 

hape \Pi 

f)xC svst {f)df . (21) 



Here we assume C sys t is a multivariate normal distribu- 
tion of the systematic parameters defined by the param- 
eters' prior values and covariance matrix. The oscillation 
parameters are obtained by maximizing the marginalized 
likelihood. 

We have studied the increase in sensitivity of the anal- 
ysis from the use of kinematic (p e ,8 e ) information and 
from the ND280 fit. The difference of the log likelihood 
at the best-fit and at another value of sin 2 2^13 is calcu- 
lated as: 



2Aln£ 



2 [In C (sin 2 29 13 ) 

ibes 
'13 



ln£'(sin 2 26<i| st )] 



(22) 



The likelihood in — 2Aln£ can include just the normal- 
ization term, or the normalization and shape term, and 



the systematic term in the likelihood can include the 
ND280 measurements or not. Fig. [33] shows the average 
— 2Aln£ curves for these three cases, for toy MC data 
generated at sin 2#i3=0.1. We obtained a 20% improve- 
ment to — 2Aln£ at sin 2 2#i3=0 when kinematic infor- 
mation is included; this is equivalent to a 20% increased 
beam exposure. Similar studies show a comparable in- 
crease of 19% for the use of ND280 information in the 
likelihood to reduce the systematic errors. 



C. Results for sin 2 2^13 

We performed the fit to the observed 11 v e candidate 
events by allowing sin 2 2^13 to vary and scanning the 
compares the (p e ,8 e ) kinematic 



value of Sep- Fig. 34 



distributions observed in data with the prediction at the 
best-fit value of sin 2 26*13 . 

Because of the potential bias in the determination of 
sin 26*13 near the physical boundary of sin 2013=0, we 
calculate the confidence intervals following the Feldman- 

The 68% and 90% confi- 



Cousins (FC) method [108] 
dence intervals calculated using the FC method and con- 
stant — 2Aln£ method are found to be equivalent. As- 
suming Scp—0, the best-fit values of sin 2 2#i3 with the 
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FIG. 33: The — 2Aln£ average sensitivity curve for toy 

MC data generated at sin 2 20i 3 =O.l with Scp=0, 
normal hierarchy and 3.01 x 10 20 POT. The likelihood 
is shown for three cases: where rate, shape and ND280 
information is used, where only rate and ND280 
information is used, and where rate and shape 
information is used without ND280 information. 



68% confidence intervals are: 



sin 2 26>i 3 = 0.0881^ (normal hierarchy) 
sin 2 26>i 3 = 0.108t£;^ (inverted hierarchy) 

The 90% confidence intervals are: 

0.030 < sin 2 26»i3 < 0.175 (normal hierarchy) 
0.038 < sin 2 26>i 3 < 0.212 (inverted hierarchy). 



Fig. 



35 shows the 68% and 90% confidence intervals for 
26*13 and the best-fit sin 2 26'i3 for each value of Sep- 

8 e ) distri- 
0, we per- 



sm zci3 

To compare the data with the best-fit (p e 
bution, assuming normal hierarchy and 5c p- 
form the Kolmogorov-Smirnov (KS) test. We reorder the 
2D (jp e ,9 e ) distribution into a ID histogram, and gener- 
ate 4000 toy MC experiments with the input value of 
sin 2 26*13 = 0.088 (best-fit value) and where the observed 
number of events is 11. We then calculate the maximum 
distance for each toy experiment and determine the frac- 
tion of toy experiments for which the maximum distance 
is equal to or more than 0.22, the value obtained for a KS 
test done on data. The p-value is 0.54 and therefore the 
(p e , 9 e ) distribution of data is consistent with the best-fit 
distribution. 

Fig. [35] shows the — 2Aln£ curve as a function of 
sin 2 26 , 13 , for Scp=0- We consider an alternate test of 
the background hypothesis using the value of — 2Aln£ 
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FIG. 34: The (p e ,0 e ) distribution of the v e events 
(dots) (top) overlaid with the prediction. The 
prediction includes the rate tuning determined from the 
fit to near detector information and a signal assuming 

the best-fit value of sin 2 26 , i 3 =0.088. The angular 
distribution (middle) of the v e events in data overlaid 
with prediction, and the momentum distribution 
(bottom) with the same convention as above. 



at sin 26 ) 13 =0. The probability of obtaining a — 2Aln£ 
at sin 2 26>i3=0 equal to or greater than the value ob- 
served in data, 8.8, is calculated using the distribution 
of — 2Aln£ from pseudo-experiments generated with 
sin 2 26'i3=0, Scp=0, normal hierarchy and fitted with 
the signal+background model. This test makes use of 
the different (p e ,9 e ) distributions of signal compared to 
background, assuming three active neutrino mixing, and 
yields a similar probability of 1 x 10~ 3 to the rate-only 
test presented earlier. 
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FIG. 35: The 68% and 90% confidence intervals for 
sin 2 20i3 scanned over values of Sep assuming normal 
hierarchy (top, b) and inverted hierarchy (bottom, d) 
with all o ther o scillation parameters fixed at the values 
in Table XIII The best-fit value of sin 2 26> 13 for each 
value of 6c p is also shown for the (p e ,6 e ) analysis. The 

— 2Aln£ curve for normal hierarchy (top, a) and 
inverted hierarchy (bottom, c) at Scp=0 are also shown 
vs. sin 2 2#i3. 



D. Alternate analysis methods 

In addition to the (p e ,6 e ) analysis, we performed an 
analysis using the reconstructed neutrino energy spec- 
trum, and a rate-only analysis. Since E r v &c is closely cor- 
related to the true neutrino energy for QE interactions, it 
provides the simplest projection for observing the energy 
dependence of the oscillation probability. This analysis 
also provides a consistency check of the use of spectral 
information in the fit. We also provide an update to the 
previous z/ e appearance analysis [21], where only rate in- 
formation was used. 

The likelihood including neutrino energy spectrum in- 
formation is defined as: 

£(o,/) = C norm (o,f) x C shape {o,f) x £ syst (f) (23) 

In this analysis, we perform a one dimensional scan of 
sin 2 26' 1 3 for each value of Sep while the other oscillation 
parameters are fixed. At each sin 2 26>i3 point, the nega- 
tive log likelihood — 21n£(o, /) is minimized by allowing 
the nuisance parameters, /, to vary. The best-fit value of 
sin 2 2#i3 is the point where — 21n£(o) is minimized and 
— 2 A ln£ is used to constructing a confidence interval for 
sin 2 26*13 according to the FC method. 

Fig. 36 shows the observed El ec distribution for the u e 
events with the best-fit of the E r v ec analysis applied. The 
observed spectrum agrees with the best-fit expectation, 
confirmed by a KS test with a p- value of 0.7. The best-fit 
values of sin 2 26 13, assuming Sep = 0, are: 



sin 2 26>i 3 
sin 2 26»i 3 



092 + 049 

U.UJZ _q 039 



0.112 



+0.058 
-0.047 



(normal hierarchy) 
(inverted hierarchy) 



The 90% confidence intervals are: 



0.033 < sin 2 26 13 < 0.179 
0.040 < sin 2 26»i3 < 0.215 



(normal hierarchy) 
(inverted hierarchy). 



The rate-only measurement only uses the number of u e 
events at SK to determine sin 2 26*13. This analysis uses 
the normalization likelihood ratio: 



A X 2 = -2 log 



r 



'"norm \ U J J I 



(24) 



where C norm is defined in Eq. [20j The value of A\ is 
calculated for the 11 observed v e candidates, in a one 
dimensional scan of sin 2 26*i3 for each point of 5cp with 
all other oscillation parameters fixed. The confidence 
intervals are determined using the FC method. 

The best-fit values of sin 2 26*13, assuming 8c p = 0, are: 

sin 2 26*i3 = 0.097±g:g|f (normal hierarchy) 

sin 2 26»i 3 = 0.123^;^ (inverted hierarchy). 

The 90% confidence intervals are: 

0.034 < sin 2 26>i 3 < 0.190 (normal hierarchy) 
0.044 < sin 2 26>i 3 < 0.236 (inverted hierarchy). 
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FIG. 36: The observed E T v ec distribution and prediction, 
assuming sin 2 26>i3=0.092, Sep = 0, and normal 
hierarchy. The background component is also shown. 



Fig. 



37 shows the three analyses are consistent with 
The rate-only analysis has a higher best-fit 
than the E r ^ c , (p e ,6 e ) analyses. This 



each other, 
value of sin 2 26> 13 
results from the additional discriminatory power of the 
kinematic information to identify events as slightly more 
similar to the background distribution than the predicted 
oscillation signal. In addition, the difference between the 
best-fit and the 90% upper confidence interval for the 
rate-only analysis is larger than the other two analyses. 
This is due to a slight (2%) over-coverage of the rate-only 
analysis. 



X. 



CONCLUSION 



In summary, we have reported the first evidence of 
electron neutrino appearance in a muon neutrino beam 
with a baseline and neutrino spectrum optimized for the 
atmospheric mass splitting. We observed 11 candidate 
v e events at the SK detector when 3.3 ± 0.4(syst.) back- 
ground events are expected, and rejected the background- 
only hypothesis with a p-value of 0.0009, equivalent to a 
3.1cr significance. We have employed a fit to the ND280 
near detector data that constrains the parametrized neu- 
trino flux and interaction models used to predict the 
event rates at SK. The ND280 constraint on the v e candi- 
dates reduced the overall systematic uncertainty to 10- 
13% depending on the value of sin 2 2#i3, an important 
step towards precision measurements of v e appearance. 
The excess of events at SK corresponds to a best-fit value 
of sin 2 26»i 3 = 0.088±g;g|| at 68% C.L., assuming S C p=0, 
sin 2 2#23=l-0 and normal hierarchy. 

This result represents an important step towards con- 
straining the unknown parameters in the three-neutrino 
oscillation model. The evidence of electron neutrino ap- 
pearance opens the door for a rich program of exper- 
imental physics in this oscillation channel. T2K mea- 
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FIG. 37: The 68% and 90% confidence interval regions 

for sin 2 20i3 scanned over values of 5c p assuming 
normal hierarchy (a) and inverted hierarchy (b) with 
the best- fit value of sin 2 26*i3 shown for the (p e , d e ) 
analysis. The 90% confidence interval region for the 
E r v ec analysis and rate-only analysis are overlaid. The 
best-fit values of sin 2 2#i3 for the E r v ec analysis and the 
rate-only analysis are also shown. All other oscillation 



parameters are fixed at the values in Table XIII 



surements of this channel will be an important input to 
global fits which also combine muon neutrino disappear- 
ance measurements and reactor-based measurements of 
6>i3 via v e disappearance to begin to constrain Sep and 
the octant of #23- Future measurements of the appear- 
ance probability for antineutrinos will provide a further 
constraint on 5c p and the mass hierarchy. 
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ternal data covariance matrices use observable bins from 
external data, such as reconstructed (7),, cos bins for 
MiniBooNE. The ND280 covariance matrix corresponds 
to the two ND280 selections' reconstructed (p^cosf^) 
bins and the SK covariance matrices correspond to the v e 
selection with either reconstructed (p e ,6 e ) or E r v ec bins: 



1 



% = ^E(pr m -ft fc )(pf m -^ 
fe=i 



(Al) 



where p\ is the expected event rate in the ith observable 
bin assuming the fcth FSI parameter set, and pf° m is the 
expected event rate in the same bin assuming the nom- 
inal FSI parameter set. For the oscillation analysis, we 
add these FSI covariance matrices to the detector effi- 
ciency and reconstruction covariance matrices evaluated 



for ND280 (Section |VTC[ ) and SK (Section |VHI| ) selec- 
tions. 



FIG. 38: Pion absorption cross section as a function of 
pion momentum overlaid with 7r + - 12 C scattering data, 
Ashery et al. [77] . Jones et al. [78], and 
Giannelli et al. [751. 



Appendix A: NEUT model external data 
comparisons and tuning 

We fit external pion scattering data and neutrino scat- 
tering data with the NEUT model while allowing subsets 
of the systematic parameters described in Table |IV| to 
vary. These fits constrain the NEUT FSI, CCQE and 
resonant pion production models. The details of these 
fits are described here. 



a. FSI model 

The NEUT FSI model includes parameters which alter 
the pion interaction probabilities for absorption, charge 
exchange, and quasi-elastic scattering [75] . The values of 
these parameters and their uncertainties are determined 
from fits to pion scattering data. Fig. [38] shows the tuned 
cascade model compared to macroscopic measurements 
of the pion absorption cross section and the maximum 
variation of the model parameters chosen to cover the 
uncertainties on the data. 

In total, we consider 16 variations of the FSI model pa- 
rameters to cover the uncertainties on macroscopic pion 
scattering data. For each of the modified FSI parame- 
ter sets and the nominal NEUT model, we evaluate with 
weights the effect on ND280, SK or external predicted 
observables by calculating the covariance matrix of the 
predicted observables. FSI covariance matrices are gener- 
ated for MiniBooNE, ND280 and SK predictions. The ex- 



b. CCQE model 



As discussed in Section |V C 2[ we fit the Mini- 
BooNE measurement of the CCQE double-differential 
cross sections in bins of muon kinetic energy and angle, 
(T M , cos 6> M ) |80, with the NEUT model. While the CCQE 
model can be directly constrained with T2K ND280 data, 
we also fit the MiniBooNE measurement since the Mini- 
BooNE detector's 47r acceptance provides coverage for 
backwards produced muons that are currently excluded 
in the ND280 selection. 

To compare the NEUT model of CCQE interactions 
with MiniBooNE data, we use the MiniBooNE flux pre- 
diction |110) to generate CCQE interactions. We fit the 
MiniBooNE double-differential cross section data with 
the NEUT prediction, allowing M% E and the overall 
cross section normalization to vary, by minimizing the 
X 2 defined as: 



( 2 (Mf,A) 



N 

E 

i=0 



'Di-XMi(M2 E ) 



A - 1 



(A2) 

Here, the sum runs over the N bins in the (T„, cos 9^) 
differential cross section, Di is the cross section measured 
by MiniBooNE in the ith bin, M; is the NEUT prediction 
in that bin and Ui is the reported shape-only component 
of the error on the measured cross section. The second 
term adds a penalty to the normalization parameter A, 
which is constrained within the MiniBooNE flux uncer- 
tainty, o~\ = 10.7%. The best-fit parameter values are 
M% E = 1.64 ± 0.04 GeV and A = 
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0.88 ±0.02. Fig_ 
shows the measured MiniBooNE cross section as a func- 
tion of Q 2 for the nominal and best-fit value of , 
which is well reproduced except at lowest values of Q 2 . 
However, this value of M? B is significantly larger than 
the value of M% E = 1.35 ± 0.17 GeV obtained by the 
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FIG. 39: The CCQE cross section as a function of Q 2 
(top) as measured by MiniBooNE (points), with the 
NEUT nominal and NEUT at the best-fit of the 
MiniBooNE CCQE (T M , cos 6*^) spectrum. Ratio of data 
to NEUT (bottom) for nominal (dashed) and best-fit 
(solid). 
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FIG. 40: The CCQE cross section as a function of 
neutrino energy (top) as measured by MiniBooNE 
(points), with the NEUT nominal and NEUT at the 
best-fit of the MiniBooNE CCQE (T M ,cos0 M ) spectrum. 
Ratio of data to NEUT (bottom) for nominal (dashed) 
and best-fit (solid). 



MiniBooNE collaboration in a fit to the single-differential 
da/dQ 2 spectrum, with an uncertainty that is smaller by 
a factor of 4. We postulate that the difference in central 
values is due to deficiencies in the nuclear model at low 
Q 2 , which MiniBooNE addressed by adding an empirical 
parameter n to modify Pauli blocking, and the lack of 
full correlations between the measured (7),, cos # M ) bins 
which are not included in the provided uncertainties. We 
assume the lack of bin correlations also causes the dis- 
crepancy in the fitted uncertainty, and this is supported 
by the relatively small x 2 = 26.9 that is observed for 137 
degrees of freedom. Furthermore, the fitted prediction 
for the total CCQE cross section as a function of energy 
is poor, as illustrated in Fig. 40 The fitted model is 



systematically higher than the MiniBooNE data above 
1 GeV, although agreement is improved near the T2K 
peak energy of 600 MeV. 

As is discussed in Section |VT} a CCQE- like selection of 
interactions in ND280 has power to constrain the CCQE 
cross section model. Since the fit to MiniBooNE data 
poorly reproduces the energy dependent cross section and 
lacks the full correlation of data points, we do not di- 
rectly tune the NEUT model with the fitted value for 
■ Instead, we set large prior uncertainties on the 
CCQE model parameters and allow the ND280 data to 
constrain the model. We set M® E to the NEUT nominal 
value (1.21 GeV), with the prior uncertainty set to the 
difference between the nominal value and best-fit value 



from the MiniBooNE fit, viz. (1.64 - 1.21 = 0.43) GeV. 
We set the uncertainty on the low energy CCQE normal- 
ization, xf E , to the size of the MiniBooNE flux uncer- 
tainty (11%). 



c. Single pion production model 



As discussed in Section IV C 31 we consider measure- 
ments of single pion production cross sections on light 
nuclei in the T2K energy range by MiniBooNE [87H89] . 
and K2K [5U]. We perform a joint fit to the MiniBooNE 
measurements of charged current single ir + production 
(CCl7r + ), charged current single n° production (CCl7r°) 
and neutral current single ir° production (NCl7r°), and 
we check the fit results with the K2K measurement. 

An important feature of the MiniBooNE single pion 
measurements is that they are defined by the particles 
exiting the target nucleus, not the particles produced at 
the neutrino interaction vertex. The measurements do 
not include corrections for FSI, but do include uncertain- 
ties of interactions of the pions in the detector. To de- 
rive the NEUT predictions for these selections, we gener- 
ate interactions according to the MiniBooNE flux as was 
done for the CCQE fits. Instead of selecting generated 
events based on the true neutrino interaction mode, such 
as CC17T+, we select the events based on the presence of 
a single pion exiting the nucleus. Hence, multiple inter- 
action types are present in the prediction for each of the 
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MiniBooNE measurements. For example, CCl7r + inter- 
actions chiefly result in a single charged pion exiting the 
nucleus, but these events may instead pass the CCi7r° se- 
lection if 7r + undergoes single charge exchange within the 
nucleus. This interdependence within the MiniBooNE se- 
lections, as well as the fact that all three are predicted 
by the same model in NEUT, justifies the use of a joint 
fit to the three measurements. 

We fit to the measured da/dQ 2 spectra from CCl7T + 
and CC17T samples and the da/dp n o spectrum from the 
NCl7r° samples. MiniBooNE provides uncertainties for 
each of the measurement. In the case of the CCl7r° and 
NCl7r° measurements, covariance matrices account for 
correlations between the measured points in the spectra 
arising from the MiniBooNE flux model and detector re- 
sponse. MiniBooNE only provides diagonal errors for the 
CCl7r + measurement. We construct a covariance matrix 
for the CCl7r + by assuming a 10% flux uncertainty cor- 
related across all bins and by adding an additional un- 
corrected uncertainty to the diagonal terms to recover 
the diagonal errors provided by MiniBooNE. While the 
flux is shared for the three measurements, at this time 
no correlation between the three measurements was con- 
sidered. 

For each of the three measured distributions (fc) we 
construct the x 2 based on the data and NEUT prediction: 



xi = EE^ - Mtmc^r 1 ^ - (A3) 



Here, i and j sum over the bins in the fcth measurement, 
D\ are the measured differential cross sections, is 
the covariance matrix describing the uncertainty on the 
measurement and Mf{x) are the NEUT predictions for 
each measurement. 

The cross section parameters that are allowed to 
vary in the fit, x, along with their prior values and 
prior uncertainties for penalty terms are listed in Ta- 
ble [Vj Contributions to the predictions from CC multi- 
pion/DIS (xcc other) interactions, NC coherent interac- 
tions, NCItt* interactions and NC multi-pion/DIS inter- 
actions are relatively small, so penalty terms are used for 
the associated parameters according to the prior uncer- 
tainties. 

We minimize the total \ 2 that includes the x 2 for each 
of the measurements and the penalty terms: 



Xtotal — XcC1it+ + XcCl-rr + XnCItt + 



E 



(A4) 

where, for each penalized parameter k, s k is the value of 
the parameter, s k wm is the nominal value, and a k is the 
prior uncertainty assigned to the penalty parameter. 



In practice, the inclusion of the NCl7r° covariance 
matrix in the fit results in a best-fit which lies out- 
side the range of the data points. This behavior results 
from strongly-correlated measurements combined with a 
model which does not correctly describe the data llllj . 
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FIG. 41 : Differential da/dp^o cross section measured by 
K2K and the nominal and best-fit from the MiniBooNE 
single pion fits NEUT predictions, with error band 
showing the uncertainties after the fit to MiniBooNE 
data. 



To achieve a fit that better reproduces the central values 
of the data points, we only use the diagonal terms of the 
NCl7r° covariance matrix in our fit. The missing corre- 
lations also result in uncertainties on the fit parameters 
which do not cover the uncertainties in the data points. 
To remedy this, we multiply the fit parameter uncertain- 
ties by a scale factor of 2 (2.5) for CC (NC) parameters, 
while keeping their correlations the same. These scale 
factor ensure that the flux-integrated cross section un- 
certainty matches that given by MiniBooNE (16% for 
each measurement). 

The results of the fit are discussed in Section fV C 31 We 
propagate the fitted values and uncertainties for M^ ES , 
x ccin an( ^ x ncitt° £ mo del the cross section in the fit 
to ND280 data described in Section [Vll[ In addition, we 
keep parameter W e s at its nominal value, but apply an 
uncertainty equal to the amount it is pulled in the fit to 
the MiniBooNE data. 

We compare the results of the fitted NEUT pion 
production model to the NC K2K measurement. The 
da/dp w o distribution measured by K2K in the 1000 ton 
water Cherenkov detector is shown with the nominal and 
tuned NEUT model in Fig. [44] As with the MiniBooNE 
data, the data prefer a peak at higher momentum and 
fewer events in the high momentum tail compared to the 
nominal NEUT prediction. The use of NEUT assuming 
the best-fit parameters from the MiniBooNE single pion 
production fits does not significantly improve the agree- 
ment between NEUT and the K2K data. However, the 
discrepancy is covered by the uncertainties on the single 
pion production and FSI model. 



